ADP_DTM_DIM.Actividades

The dataset examined has the following dimensions:

Feature Result
Number of observations 419
Number of variables 4

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdActividad numeric 419 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
Descripcion character 282 0.00 % ×
Orden integer 210 0.00 %

Variable list

SkIdActividad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 419
Median 1004355731714487
1st and 3rd quartiles 1002033041407545; 1006724442059458
Min. and max. 1004758460013; 1008861420467332

  • Note that the following possible outlier values were detected: "1004758460013", "10038346358630", "10038443305114", "10044414783636", "10050262658754", …, "100830440010135", "100855018116287", "100863072203575", "100864418640804", "100867322653500" (47 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 282
Mode “Concreto pobre afinado”

  • Note that the following levels have at most five observations: "02-CIM-000 Inicio cimentación profunda", "1.01.000 Licencia de Construcción", "1.01.001 Localización y replanteo", "1.01.010 Aseo y limpieza de lote", "1.01.015 - 23 Tala de árboles, traslado y compensación", …, "Zona 3 de -19.92 a -23.12", "Zona 4 de -16.72 a -19.92", "Zona 4 de -19.92 a -23.12", "Zona 5 de -16.72 a -19.92", "Zona 5 de -19.92 a -23.12" (266 values omitted).

Orden

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 210
Median 104
1st and 3rd quartiles 52; 156.5
Min. and max. 0; 209


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:10

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Bodega

The dataset examined has the following dimensions:

Feature Result
Number of observations 92153
Number of variables 4

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdBodega integer 193 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
CodigoBodega integer 1 0.00 % ×
Descripcion character 3 0.00 %

Variable list

SkIdBodega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 193
Median 1000165
1st and 3rd quartiles 1000118; 1000224
Min. and max. 10003; 1000295

  • Note that the following possible outlier values were detected: "10003", "10005", "10006", "10007", "10009", …, "100030", "100031", "100034", "100035", "100037" (15 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

CodigoBodega

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “Bodega Principal”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:13

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.CapituloPresupuesto

The dataset examined has the following dimensions:

Feature Result
Number of observations 2007
Number of variables 8

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdCapitulo integer 2007 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
Codigo.Proyecto integer 88 0.00 % ×
Capitulo.Numero character 80 0.00 % ×
Capitulo.Descripcion character 192 0.00 % ×
Tipo.Costo character 4 0.00 % ×
Tipo.Costo.Orden integer 4 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2007
Median 1002042714
1st and 3rd quartiles 1001161221.5; 1002423557.5
Min. and max. 100346; 1002954543

  • Note that the following possible outlier values were detected: "100346", "100347", "100348", "100349", "100350", …, "1002954539", "1002954540", "1002954541", "1002954542", "1002954543" (738 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Codigo.Proyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 88
Median 204
1st and 3rd quartiles 116; 242
Min. and max. 3; 295

  • Note that the following possible outlier values were detected: "276", "277", "278", "279", "280", "281", "283", "288", "294", "295".

Capitulo.Numero

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 80
Mode “37”

  • The following suspected missing value codes enter as regular values: "8", "9".

  • Note that the following levels have at most five observations: "01", "01 00 00", "02", "02 00 00", "03 00 00", …, "50", "55", "60", "60 00 00", "CI" (33 values omitted).


Capitulo.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 192
Mode “IMPREVISTOS”

  • Note that the following levels have at most five observations: "ABERTURAS Y FACHADAS", "ACABADOS", "ACABADOS DE PISO", "ACABADOS EN MUROS", "ACERO DE REFUERZO", …, "URBANISMOS", "UTILIDAD", "VENTANERIA Y FACHADAS", "VENTANERIAS", "VIGILANCIA" (135 values omitted).

  • Note that there might be case problems with the following levels: "Equipos y herramientas", "EQUIPOS Y HERRAMIENTAS", "Estructura", "ESTRUCTURA", "Pañetes", "PAÑETES", "Pintura", "PINTURA", "Preliminares", "PRELIMINARES".


Tipo.Costo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “COSTOS DIRECTOS”

  • Note that the following levels have at most five observations: "COSTO DIRECTO".

Tipo.Costo.Orden

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “1”
Reference category 0

  • Note that the following levels have at most five observations: "54".

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:16

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.ControlClaseOrigen

The dataset examined has the following dimensions:

Feature Result
Number of observations 36
Number of variables 5

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdClaseOrigen integer 36 0.00 %
Clase character 7 0.00 % ×
Clase.Descripcion character 6 0.00 % ×
Origen character 21 0.00 % ×
Origen.Descripcion character 32 0.00 % ×

Variable list

SkIdClaseOrigen

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 36
Median 18.5
1st and 3rd quartiles 9.75; 28.25
Min. and max. 1; 38


Clase

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 7
Mode “I”

  • Note that the following levels have at most five observations: "B", "J", "P", "Y".

Clase.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 6
Mode “Invertido”

  • Note that the following levels have at most five observations: "Ejecutado", "Presupuestado", "Proyectado".

Origen

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 21
Mode “C”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "", "C", "D", "E", "ED", …, "TE", "TS", "V", "X", "Y" (11 values omitted).


Origen.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 32
Mode “Cuentas Control”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "", "Actas Descuento Menor Valor", "Actas Generales", "Actas Por Grupos", "Actas Todo Costo", …, "Transformacion Entradas", "Transformacion Salidas", "Traslados Entradas", "Traslados Salidas", "Valores Comprados" (22 values omitted).


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:20

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Empresa

The dataset examined has the following dimensions:

Feature Result
Number of observations 1
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
NombreEmpresa character 1 0.00 % ×
Nit integer 1 0.00 % ×
Direccion character 1 0.00 % ×
Ref_IdEmpresa integer 1 0.00 % ×
Ref_BdConfServidor integer 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

NombreEmpresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.


Nit

  • The variable only takes one (non-missing) value: "860067697". The variable contains 0 % missing observations.

Direccion

  • The variable is a key (distinct values for each observation).

  • The variable only takes one (non-missing) value: "CRA 19 No 90-10". The variable contains 0 % missing observations.


Ref_IdEmpresa

  • The variable only takes one (non-missing) value: "1". The variable contains 0 % missing observations.

Ref_BdConfServidor

  • The variable only takes one (non-missing) value: "1". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:23

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EspecicficacionDePedidos

The dataset examined has the following dimensions:

Feature Result
Number of observations 111611
Number of variables 5

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdPedido integer 111611 0.00 % ×
Codigo.Orden.De.Compra numeric 23817 20.49 % ×
Pedido.Urgente character 2 0.00 %
Tipo.Pedido character 2 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdPedido

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 111611
Median 10080194
1st and 3rd quartiles 10035733.5; 100111863.5
Min. and max. 100108; 100141328

  • Note that the following possible outlier values were detected: "100108", "100118", "100119", "100120", "100121", …, "1009991", "1009992", "1009993", "1009995", "1009996" (7971 values omitted).

Codigo.Orden.De.Compra

Feature Result
Variable type numeric
Number of missing obs. 22867 (20.49 %)
Number of unique values 23816
Median 16700167.5
1st and 3rd quartiles 350482.75; 22500065.25
Min. and max. 30083; 29500001

  • Note that the following possible outlier values were detected: "27700001", "27700002", "27700003", "27700004", "27700005", …, "28300003", "28300004", "28300005", "29400001", "29500001" (62 values omitted).

Pedido.Urgente

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “NO”


Tipo.Pedido

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “ADICIONAL”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:26

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EspecificacionDeActas

The dataset examined has the following dimensions:

Feature Result
Number of observations 55896
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdEspecificacionActas numeric 55896 0.00 % ×
No.Acta integer 385 0.00 % ×
No.Contrato integer 11407 0.00 % ×
No.Factura character 43507 0.00 % ×
Codigo.de.barras integer 55896 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdEspecificacionActas

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 55896
Median 10021021002376.5
1st and 3rd quartiles 10010810801223.8; 10027627600024.2
Min. and max. 1003300011; 1002492490076239

  • Note that the following possible outlier values were detected: "1003300011", "1003300012", "1003300021", "1003300022", "1003300023", …, "1002492490076235", "1002492490076236", "1002492490076237", "1002492490076238", "1002492490076239" (26353 values omitted).

No.Acta

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 385
Median 4
1st and 3rd quartiles 2; 11
Min. and max. 1; 385

  • Note that the following possible outlier values were detected: "79", "80", "81", "82", "83", …, "381", "382", "383", "384", "385" (297 values omitted).

No.Contrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11407
Median 2000297.5
1st and 3rd quartiles 1080095; 2280421
Min. and max. 30001; 2940002

  • Note that the following possible outlier values were detected: "2490001", "2490002", "2490003", "2490004", "2490005", …, "2880098", "2880100", "2880102", "2940001", "2940002" (1738 values omitted).

No.Factura

Feature Result
Variable type character
Number of missing obs. 1 (0 %)
Number of unique values 43506
Mode “”

  • The following suspected missing value codes enter as regular values: "", "8", "88", "888", "9", "99", "999".

  • The following values appear with prefixed or suffixed white space: "112873035 ", "21 ", "63919687 ", "71682466 ", "A2431 ", …, "RO382 ", "RO560 ", "TRAY42 ", "VSFE1437 ", "VSFE740 " (27 values omitted).

  • Note that the following levels have at most five observations: " 01103", " 0122458", " 10209", " 105412587", " 105415234", …, "ZA82", "ZA84", "ZA9", "ZC1740", "ZC2204" (43195 values omitted).

  • Note that there might be case problems with the following levels: "Ajuste", "AJUSTE", "anulada", "ANULADA", "anulado", …, "fe114", "FE114", "no pago", "No Pago", "NO PAGO" (15 values omitted).


Codigo.de.barras

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 55896
Median 49050
1st and 3rd quartiles 20845.25; 65503.25
Min. and max. 49; 86400


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:16:32

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EspecificacionDeContratos

The dataset examined has the following dimensions:

Feature Result
Number of observations 11950
Number of variables 10

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdContrato numeric 11950 0.00 %
No..Contrato integer 11950 0.00 % ×
Descripcion character 9769 0.00 % ×
Formas.de.pago character 1169 0.00 % ×
Clase.Contrato character 3 0.00 % ×
Fecha.de.creacion character 3452 0.00 % ×
Usuario.de.creacion character 97 0.00 % ×
Fecha.Inicio character 3475 0.00 % ×
Fecha.Fin character 3001 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdContrato

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 11950
Median 1001881880267.5
1st and 3rd quartiles 10035350308.25; 1002282280720.75
Min. and max. 100330001; 1002952950003


No..Contrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11950
Median 1880267.5
1st and 3rd quartiles 350308.25; 2280720.75
Min. and max. 30001; 2950003

  • Note that the following possible outlier values were detected: "2800001", "2800002", "2800003", "2800004", "2800005", …, "2940002", "2940003", "2950001", "2950002", "2950003" (247 values omitted).

Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 9769
Mode “ACARREOS URBANOS”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " ALQUILER DE BAÑO PORTATIL", " Alquiler de equipo etapa 2", " Alquiler Retroexcavadora E42 para construcción de plataformas de trabajo ", " Comisión de topografía", " CONSOLIDACION Y MAMPOSTERIA PAÑETES Y VANOS ANEXIDADES – CLAUSTRO – TEMPLO - CLUB", …, "VISITA TECNICA TRANSFORMADORES ", "Visitas de geotecnia ", "Visitas de Geotecnista ", "Visitas de topografía ", "volante, pasa calles, eventos " (1713 values omitted).

  • Note that the following levels have at most five observations: "- Actualización de la cimentación de la Torre A (Interior 11) de acuerdo al levantamiento topográfico elaborado por la obra.\n- Diseño de tanque de agu", " ALQUILER DE BAÑO PORTATIL", " Alquiler de equipo etapa 2", " Alquiler Retroexcavadora E42 para construcción de plataformas de trabajo ", " Comisión de topografía", …, "VISITAS TECNICAS (ASESORIA ESTRUCTURAL)", "VOLADURAS CONTROLADAS CIMENTACION EXISTENTE", "volante, pasa calles, eventos ", "Volquetas retiro de material sobrante (escombro)", "Workstation Preci 3581 Int Ci7 13700h 16g/1ts W11" (9676 values omitted).

  • Note that there might be case problems with the following levels: "Acarreos obra ", "ACARREOS OBRA ", "Acarreos urbanos", "Acarreos Urbanos", "ACARREOS URBANOS", …, "Transporte y disposición de residuos", "TRANSPORTE Y DISPOSICIÓN DE RESIDUOS", "Transportes urbanos", "Transportes Urbanos", "TRANSPORTES URBANOS" (567 values omitted).


Formas.de.pago

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 1169
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " 50% material en obra, saldo 50% ontra entrega", " actas parciales de acuerdo ala avance de obra", " actas segun avance de obra", " anticipo y actas catorcenal", " Cortes de Obra ", …, "un solo corte ", "UN SOLO CORTE ", "Una sola vez ", "UNICA VEZ ", "unico pago " (174 values omitted).

  • Note that the following levels have at most five observations: " 50% material en obra, saldo 50% ontra entrega", " actas parciales de acuerdo ala avance de obra", " actas segun avance de obra", " anticipo y actas catorcenal", " Cortes de Obra ", …, "unico Pago", "Unico pago", "UNICO PAGO", "unico pago ", "UNICO POR CONTRATO" (1022 values omitted).

  • Note that there might be case problems with the following levels: "10% anticipo; cortes quincenales", "10% Anticipo; cortes quincenales", "100% anticipado", "100% Anticipado", "100% contraentrega", …, "unico pago", "unico Pago", "Unico pago", "Unico Pago", "UNICO PAGO" (377 values omitted).


Clase.Contrato

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “GENERALES”

  • Note that the following levels have at most five observations: "TODO COSTO".

Fecha.de.creacion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3452
Mode “17/03/2015”

  • Note that the following levels have at most five observations: "01/02/2012", "01/02/2013", "01/02/2016", "01/02/2018", "01/02/2020", …, "31/10/2021", "31/10/2022", "31/10/2023", "31/10/2024", "31/10/2025" (2879 values omitted).

Usuario.de.creacion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 97
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: "Ana Maria Leon ", "Juan Felipe Murillo ", "Maria Mercedes Arias ", "William Alfredo Fernandez Leon ".

  • Note that the following levels have at most five observations: "Agustin Bolivar", "Andrés Camilo Montañez", "Cesar David Sotaquira", "Daniel Alejandro Viana", "Edgar Joaquín Ríos", …, "Maria Angelica Oliva", "Mauricio Lemus", "Miguel Matamala", "Natalia Moreno", "Tania Alejandra Acevedo Barriga" (8 values omitted).


Fecha.Inicio

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3475
Mode “01/10/2024”

  • Note that the following levels have at most five observations: "01/01/1900", "01/01/2011", "01/01/2013", "01/01/2014", "01/01/2019", …, "31/10/2014", "31/10/2017", "31/10/2020", "31/10/2024", "31/12/2024" (2939 values omitted).

Fecha.Fin

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3001
Mode “31/12/2024”

  • Note that the following levels have at most five observations: "01/01/1900", "01/01/2013", "01/01/2014", "01/01/2018", "01/01/2021", …, "31/10/2027", "31/10/2028", "31/12/2013", "31/12/2027", "31/12/2029" (2660 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:19:52

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EspecificacionDeEntradasAlmacen

The dataset examined has the following dimensions:

Feature Result
Number of observations 63318
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdEspecificacionEntradasAlmacen numeric 63302 0.00 % ×
No.Entrada integer 63302 0.00 %
Remision character 49501 0.00 % ×
No.Factura character 48192 0.00 % ×
Codigo.de.barras integer 63318 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdEspecificacionEntradasAlmacen

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 63302
Median 1002282280270.5
1st and 3rd quartiles 1001081080355.25; 10021721700443.8
Min. and max. 100330001; 10029429400005

  • Note that the following possible outlier values were detected: "100330001", "100330002", "100330003", "100330004", "100330005", …, "10035352196", "10035352197", "10035352198", "10035352199", "10035352200" (15500 values omitted).

No.Entrada

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 63302
Median 2280270.5
1st and 3rd quartiles 1080355.25; 21700443.75
Min. and max. 30001; 29400005


Remision

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 49501
Mode “”

  • The following suspected missing value codes enter as regular values: "", ".", ".215141 ", ".30265", ".4024", "8", "88", "9", "99", "9999".

  • The following values appear with prefixed or suffixed white space: " 209164", ".215141 ", "43-00003022 ".

  • Note that the following levels have at most five observations: " 209164", ".", ".215141 ", ".30265", ".4024", …, "WHS 151728", "WHS 152294", "WHS141391", "WHS142622", "WHS144190" (49390 values omitted).


No.Factura

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 48192
Mode “”

  • The following suspected missing value codes enter as regular values: "", "88", "99".

  • The following values appear with prefixed or suffixed white space: " 1012116280", " CRE 23870", " F290-00117092", " F33070", " F33537", …, "FACT148173 ", "PR259-17 ", "PR281-17 ", "RI2399839 ", "RI29348 " (14 values omitted).

  • Note that the following levels have at most five observations: " 1012116280", " CRE 23870", " F290-00117092", " F33070", " F33537", …, "X2621041243", "X2651025554", "X2742507339", "X2861025569", "YBE887875" (47741 values omitted).

  • Note that there might be case problems with the following levels: "22v218042", "22V218042", "f333", "F333", "f5602", …, "FIG083470", "pf46794", "PF46794", "Toc15884", "TOC15884" (8 values omitted).


Codigo.de.barras

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 63318
Median 142240.5
1st and 3rd quartiles 119185.5; 158829.75
Min. and max. 118; 175079

  • Note that the following possible outlier values were detected: "118", "119", "120", "121", "122", …, "1994", "1995", "1996", "1997", "1999" (906 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:20:17

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EspecificacionEjecucionCliente

The dataset examined has the following dimensions:

Feature Result
Number of observations 29
Number of variables 4

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdEspecificacionEjecucionCliente numeric 29 0.00 % ×
NoActaCliente integer 26 0.00 % ×
ContratoCliente character 2 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdEspecificacionEjecucionCliente

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 29
Median 10018812
1st and 3rd quartiles 1001885; 10018819
Min. and max. 10061; 10027527500003

  • Note that the following possible outlier values were detected: "10027527500001", "10027527500002", "10027527500003".

NoActaCliente

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 26
Median 12
1st and 3rd quartiles 5; 19
Min. and max. 1; 27500003

  • Note that the following possible outlier values were detected: "27500001", "27500002", "27500003".

ContratoCliente

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "27/08/2024".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:16

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EstadoEnvioDocumento

The dataset examined has the following dimensions:

Feature Result
Number of observations 3
Number of variables 2

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEstadoEnvioDocumento integer 3 0.00 % ×
Descripcion character 3 0.00 % ×

Variable list

SkIdEstadoEnvioDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “-1”
Reference category -1

  • Note that the following levels have at most five observations: "-1", "0", "1".

Descripcion

  • The variable is a key (distinct values for each observation).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:19

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.EstadoPorDocumento

The dataset examined has the following dimensions:

Feature Result
Number of observations 70
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEstadoPorDocumento integer 70 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
SkIdEstado integer 21 0.00 % ×
Descripcion.Estado character 50 0.00 % ×
Tipo.Documento character 14 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEstadoPorDocumento

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 70
Median 10072.5
1st and 3rd quartiles 10026.25; 100122.5
Min. and max. -100111; 1006200006

  • Note that the following possible outlier values were detected: "-100111", "-100101", "-10076", "-10075", "-10072", …, "1006200002", "1006200003", "1006200004", "1006200005", "1006200006" (2 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdEstado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 21
Median 2
1st and 3rd quartiles 0; 4
Min. and max. -6; 200006

  • Note that the following possible outlier values were detected: "-6", "-5", "200001", "200002", "200003", "200004", "200005", "200006".

Descripcion.Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 50
Mode “APROBADO”

  • The following values appear with prefixed or suffixed white space: "Por Aprobación ", "Por Preaprobación ".

  • Note that the following levels have at most five observations: "Abierto", "AJUSTES GENERADOS", "Anulada", "APROBACIÓN DE ACTAS", "Aprobada", …, "RECHAZADO INTERVENTOR", "RECHAZADO TÉCNICO", "SOLICITADA", "SOLICITADO", "TÉCNICO" (40 values omitted).

  • Note that there might be case problems with the following levels: "Aprobada", "APROBADA", "Aprobado", "APROBADO", "Cerrado", …, "NO PAGO", "Programada", "PROGRAMADA", "Programado", "PROGRAMADO" (4 values omitted).


Tipo.Documento

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 14
Mode “PEDIDOS”

  • Note that the following levels have at most five observations: "ANTICIPOS", "CONTRATOS", "DEVOLUCIONES", "EJECUCION CLIENTE", "EJECUCION ESTANDAR", …, "NOTAS EN VALOR", "POLIZAS DE CONTRATOS", "PROYECCION", "SALIDAS DE ALMACEN", "TRASLADOS DE ALMACEN" (1 values omitted).

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:22

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Fecha

The dataset examined has the following dimensions:

Feature Result
Number of observations 22645
Number of variables 14

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdFecha integer 22645 0.00 % ×
Fecha character 22645 0.00 % ×
Año integer 62 0.00 % ×
Mes integer 12 0.00 %
Dia integer 31 0.00 %
DiaDelAño integer 366 0.00 %
SemanaDelAño integer 54 0.00 %
Trimestre integer 4 0.00 %
Semestre integer 2 0.00 %
NombreMes character 12 0.00 %
NombreMesCorto character 12 0.00 %
NombreDia character 7 0.00 %
NombreDiaCorto character 7 0.00 %
MesAño character 732 0.00 %

Variable list

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 22645
Median 20200101
1st and 3rd quartiles 20040702; 20350702
Min. and max. 19000101; 20501231

  • Note that the following possible outlier values were detected: "19000101", "19000102", "19000103", "19000104", "19000105", …, "19001227", "19001228", "19001229", "19001230", "19001231" (355 values omitted).

Fecha

  • The variable is a key (distinct values for each observation).

Año

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 62
Median 2020
1st and 3rd quartiles 2004; 2035
Min. and max. 1900; 2050

  • Note that the following possible outlier values were detected: "1900".

Mes

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 12
Median 7
1st and 3rd quartiles 4; 10
Min. and max. 1; 12


Dia

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 31
Median 16
1st and 3rd quartiles 8; 23
Min. and max. 1; 31


DiaDelAño

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 366
Median 183
1st and 3rd quartiles 92; 274
Min. and max. 1; 366


SemanaDelAño

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 54
Median 27
1st and 3rd quartiles 14; 40
Min. and max. 1; 54


Trimestre

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “3”
Reference category 1


Semestre

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “2”
Reference category 1


NombreMes

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 12
Mode “Agosto”


NombreMesCorto

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 12
Mode “Ago”


NombreDia

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 7
Mode “Lunes”


NombreDiaCorto

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 7
Mode “Lun”


MesAño

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 732
Mode “Ago-00”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:27

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Insumo

The dataset examined has the following dimensions:

Feature Result
Number of observations 19251
Number of variables 24

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdInsumo integer 19251 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
Empresa character 1 0.00 % ×
Codigo.Insumo integer 19251 0.00 %
Insumo.Descripcion character 19251 0.00 % ×
Agrupacion numeric 287 0.00 % ×
Agrupacion.Descripcion character 287 0.00 % ×
Tipo character 6 0.00 %
Tipo.Descripcion character 6 0.00 %
Unidad character 30 0.00 % ×
Descripcion.Unidad character 30 0.00 % ×
Estado character 1 0.00 % ×
Requiere.Equipo character 1 0.00 % ×
Dias.Reposicion integer 6 0.00 % ×
SubAnalisis character 1 0.00 % ×
Devolutivo character 2 0.00 %
Stock.Maximo integer 1 0.00 % ×
Stock.Minimo integer 1 0.00 % ×
Valor.Unitario numeric 9746 0.00 % ×
Porcentaje.IVA numeric 5 0.00 %
Valor.Neto numeric 10052 0.00 % ×
Fecha.Creacion character 1999 0.00 % ×
Fecha.Modificacion character 1687 0.00 % ×
Codigo.Insumo.Id integer 19251 0.00 %

Variable list

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 19251
Median 1009729
1st and 3rd quartiles 1004915.5; 10014543.5
Min. and max. 100101; 10019356

  • Note that the following possible outlier values were detected: "100101", "100102", "100103", "100104", "100105", …, "100995", "100996", "100997", "100998", "100999" (889 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Codigo.Insumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 19251
Median 9729
1st and 3rd quartiles 4915.5; 14543.5
Min. and max. 101; 19356


Insumo.Descripcion

  • The variable is a key (distinct values for each observation).

Agrupacion

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 287
Median 1301
1st and 3rd quartiles 901; 2504
Min. and max. 101; 9001

  • Note that the following possible outlier values were detected: "101", "102", "103", "104", "105", …, "405", "406", "407", "408", "409" (45 values omitted).

Agrupacion.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 287
Mode “Demás Elementos de Ferretería”

  • Note that the following levels have at most five observations: "Accesorios para redes gases medicinales", "Almacenamiento/Bodegajes", "Anticipo Retención en la Fuente", "Asesorías en Obra", "Beneficios", …, "Rendimientos Financieros", "Retención IVA Régimen Simplificado", "Terreno", "Tinas y Jacuzzies", "Utilidad" (50 values omitted).

Tipo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 6
Mode “M”


Tipo.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 6
Mode “Materiales”


Unidad

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 30
Mode “un”

  • Note that the following levels have at most five observations: "%", "cm", "gr", "km", "p2", "pl", "pz", "sg", "sm", "tb".

Descripcion.Unidad

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 30
Mode “UNIDAD”

  • Note that the following levels have at most five observations: "CENTIMETRO", "GRAMO", "KILOMETRO", "PIE CUADRADO", "PIE LINEAL", "PIEZA", "PORCENTAJE", "SEMANA", "SUMA GLOBAL", "TAMBOR".

Estado

  • The variable only takes one (non-missing) value: "ACTIVO". The variable contains 0 % missing observations.

Requiere.Equipo

  • The variable only takes one (non-missing) value: "NO". The variable contains 0 % missing observations.

Dias.Reposicion

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 6
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 60

  • Note that the following possible outlier values were detected: "3", "5", "10", "15", "60".

SubAnalisis

  • The variable only takes one (non-missing) value: "NO". The variable contains 0 % missing observations.

Devolutivo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “NO”


Stock.Maximo

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Stock.Minimo

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 9746
Median 26010.34
1st and 3rd quartiles 60; 233122.55
Min. and max. 0; 9730090057.86

  • Note that the following possible outlier values were detected: "4387255.17", "4400000", "4400574.71", "4408000", "4424125", …, "3117233658.91", "3.2e+09", "4277358913.26", "7316435173", "9730090057.86" (859 values omitted).

Porcentaje.IVA

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 5
Mode “19”
Reference category 0


Valor.Neto

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 10052
Median 30000
1st and 3rd quartiles 71.4; 266457.02
Min. and max. 0; 9730090057.86

  • Note that the following possible outlier values were detected: "4961900", "4971322.1", "4973951.5", "4997012.8", "4998000", …, "3159328000", "3.808e+09", "4277358913.26", "7316435173", "9730090057.86" (862 values omitted).

Fecha.Creacion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 1999
Mode “06/12/2010”

  • Note that the following levels have at most five observations: "01/02/2012", "01/02/2013", "01/02/2016", "01/02/2021", "01/02/2023", …, "31/08/2023", "31/10/2011", "31/10/2013", "31/10/2014", "31/10/2016" (1404 values omitted).

Fecha.Modificacion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 1687
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "01/02/2012", "01/03/2013", "01/03/2015", "01/03/2016", "01/04/2015", …, "31/08/2015", "31/08/2023", "31/10/2011", "31/10/2013", "31/10/2014" (1224 values omitted).


Codigo.Insumo.Id

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 19251
Median 9729
1st and 3rd quartiles 4915.5; 14543.5
Min. and max. 101; 19356


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:35

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Items

The dataset examined has the following dimensions:

Feature Result
Number of observations 49311
Number of variables 23

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdItems integer 49311 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
SkIdAPU integer 37856 0.00 %
SkIdNivel integer 1 0.00 % ×
Empresa character 1 0.00 % ×
Item.No character 21497 0.00 % ×
SubCapitulo character 512 0.00 % ×
Item.Descripcion character 23786 0.00 % ×
Cantidad numeric 11916 0.00 % ×
Valor.Sin.IVA numeric 17985 0.00 % ×
Precio.Venta numeric 476 0.00 % ×
Codigo.Cliente character 367 0.00 % ×
Cantidad.Proyectada numeric 9796 0.00 % ×
Unidad.Medida character 62 0.00 % ×
Item.estado character 3 0.00 %
Metro.cuadrado numeric 3 0.00 %
Aplica.En.Contratos character 2 0.00 %
Aplica.En.Almacen character 2 0.00 %
Bloqueo.De.Contratos.Por.Cantidad character 2 0.00 %
Bloqueo.De.Contratos.Por.Valor character 2 0.00 %
Bloqueo.De.Salidas.Por.Cantidad character 2 0.00 %
Bloqueo.De.Salidas.Por.Valor character 2 0.00 %
Clase.Item logical 1 100.00 % ×

Variable list

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 49311
Median 10077795
1st and 3rd quartiles 10038965.5; 100106680.5
Min. and max. 1002462; 100145937

  • Note that the following possible outlier values were detected: "1002462", "1002463", "1002464", "1002499", "1002503", …, "1009989", "1009991", "1009992", "1009993", "1009994" (2006 values omitted).

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdAPU

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 37856
Median 2110861
1st and 3rd quartiles 1173627; 226011666.5
Min. and max. 30004; 295021079


SkIdNivel

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Item.No

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 21497
Mode “7.001”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: "01 54 00 80 ", "01 56 26 1 ", "01 91 13 1 ", "09 30 13 51 ", "23.704 ", "25.05.801 ", "26 43 13 1 ".

  • Note that the following levels have at most five observations: "|25.04.003", "01 31 13 1.1", "01 31 13 1.2", "01 31 13 10.1", "01 31 13 10.2", …, "9.96", "9.97", "9.98", "9.990", "9.991" (19506 values omitted).

  • Note that there might be case problems with the following levels: "2.01.106.2a", "2.01.106.2A", "2.01.106.2b", "2.01.106.2B", "2.01.106.3a", …, "2.01.106.3B", "2.01.106.4a", "2.01.106.4A", "2.01.106.4b", "2.01.106.4B" (2 values omitted).


SubCapitulo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 512
Mode “”

  • The following suspected missing value codes enter as regular values: "", "-".

  • The following values appear with prefixed or suffixed white space: " Generales", " Torre 2", " Unidad estructural 8", "Plataforma - E1 ", "Reclamaciones ".

  • Note that the following levels have at most five observations: " Torre 2", "|", "100x100x5 Protección pasiva contra fuego", "200x200x5 Protección pasiva contra fuego", "300x300x1/2 Protección pasiva contra fuego", …, "Washer", "Zapata ARC", "Zapata ARE", "Zarpa muro de contención ARC", "Zonas Comunales" (229 values omitted).

  • Note that there might be case problems with the following levels: "claustro", "Claustro", "Comercio", "COMERCIO", "comunes", …, "Urbanismo Interno", "Vivienda", "VIVIENDA", "Zonas comunales", "Zonas Comunales" (33 values omitted).


Item.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 23786
Mode “Concreto pobre”

  • The following suspected missing value codes enter as regular values: ".", "..".

  • The following values appear with prefixed or suffixed white space: " Administración de obra (Costo reembolsable)", " Ajuste TRM Puertas cortafuego Almacen el Arq", " Banca coworking C-20", " Banca coworking C-21", " CIELO RASO EN LAMINA DE PVC", …, "Vigas de amarres concreto 4000 psi ", "Vigas y viguetas segunda etapa ", "Vinilo sobre pañete ", "Win plástico ", "Zonas duras " (1089 values omitted).

  • Note that the following levels have at most five observations: " Administración de obra (Costo reembolsable)", " Banca coworking C-20", " Banca coworking C-21", " CIELO RASO EN LAMINA DE PVC", " CIELO RASO EN PANELES DE YESO 1/2”+", …, "Zona verde piso 1", "Zonas comunales - BBQ", "Zonas duras ", "Zonas Verdes", "Zorra metálica canecas" (22274 values omitted).

  • Note that there might be case problems with the following levels: "Acero de 60000 psi cimentación", "Acero de 60000 PSI cimentación", "Acero de 60000 psi pilotes", "Acero de 60000 PSI pilotes", "Acero de refuerzo escaleras", …, "Vigas de Cimentación en Concreto", "Win plástico", "Win Plástico", "Zapatas en concreto 3000 psi", "Zapatas en concreto 3000 PSI" (272 values omitted).


Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 11916
Median 16.7
1st and 3rd quartiles 1; 157.76
Min. and max. 0; 7225562

  • Note that the following possible outlier values were detected: "2993.17", "2996.36", "2996.9", "2997.55", "3000", …, "1451505.4", "1625491.61", "1821837.73", "2478320.31", "7225562" (1485 values omitted).

Valor.Sin.IVA

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 17985
Median 464231.12
1st and 3rd quartiles 0; 10717500.12
Min. and max. -176105480; 1.8e+11

  • Note that the following possible outlier values were detected: "-176105480", "264097508.81", "264432700.6", "265583403", "265638779.05", …, "15086542242.09", "1.8e+10", "21227074000", "39874619881.97", "1.8e+11" (1149 values omitted).

Precio.Venta

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 476
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 1.4e+09

  • Note that the following possible outlier values were detected: "1", "100", "800", "2081.38", "3081", …, "183851314", "193499918", "203347597", "245550000", "1.4e+09" (465 values omitted).

Codigo.Cliente

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 367
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "1.01.01", "1.01.02", "1.01.03", "1.01.04", "1.01.05", …, "OC 95", "OC 96", "OC 97", "OC 98", "OC 99" (356 values omitted).


Cantidad.Proyectada

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 9796
Median 0
1st and 3rd quartiles 0; 0
Min. and max. -224187.91; 1435780.16

  • Note that the following possible outlier values were detected: "-224187.91", "-206981.83", "-135627", "-127364.18", "-125373.92", …, "359014.57", "399875.42", "592705.38", "804026.97", "1435780.16" (9785 values omitted).

Unidad.Medida

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 62
Mode “un”

  • The following values appear with prefixed or suffixed white space: "m2 ", "un ", "Un ".

  • Note that the following levels have at most five observations: "%", "0", "dia", "gbl", "glb", …, "ton", "ün", "un ", "Visi", "VJ" (17 values omitted).

  • Note that there might be case problems with the following levels: "di", "DI", "gb", "GB", "gl", …, "Un ", "und", "Und", "vj", "VJ" (29 values omitted).


Item.estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “EJECUCIÓN”


Metro.cuadrado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “0”
Reference category 0


Aplica.En.Contratos

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Aplica.En.Almacen

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Bloqueo.De.Contratos.Por.Cantidad

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Bloqueo.De.Contratos.Por.Valor

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Bloqueo.De.Salidas.Por.Cantidad

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Bloqueo.De.Salidas.Por.Valor

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “SI”


Clase.Item

  • The variable only takes one value: "NA".

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:27:49

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.NivelesPresupuesto

The dataset examined has the following dimensions:

Feature Result
Number of observations 0
Number of variables 8

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdNivel logical 0 NaN % ×
SkIdEmpresa logical 0 NaN % ×
Codigo.Proyecto logical 0 NaN % ×
Id.Nivel logical 0 NaN % ×
Descripcion.Nivel logical 0 NaN % ×
Nivel.Auxiliar logical 0 NaN % ×
Orden logical 0 NaN % ×
Empresa logical 0 NaN % ×

Variable list

SkIdNivel

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdEmpresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Codigo.Proyecto

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Id.Nivel

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Descripcion.Nivel

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Nivel.Auxiliar

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Orden

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Empresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:29:14

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.OrigenDelDocumento

The dataset examined has the following dimensions:

Feature Result
Number of observations 7
Number of variables 2

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdOrigenDelDocumento integer 7 0.00 %
Descripcion character 7 0.00 % ×

Variable list

SkIdOrigenDelDocumento

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7
Median 3
1st and 3rd quartiles 1.5; 4.5
Min. and max. 0; 6


Descripcion

  • The variable is a key (distinct values for each observation).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:29:16

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Proyecto

The dataset examined has the following dimensions:

Feature Result
Number of observations 89
Number of variables 37

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdProyecto integer 89 0.00 % ×
Codigo.Proyecto integer 89 0.00 %
Nombre.Proyecto character 89 0.00 % ×
Clase.Proyecto character 4 0.00 % ×
Tipo character 2 0.00 % ×
Estado character 3 0.00 %
Presupuesto.Fijo character 2 0.00 %
Propietario character 4 0.00 % ×
Sucursal integer 89 0.00 %
Sucursal.Nombre character 89 0.00 % ×
MacroProyecto integer 11 52.81 % ×
MacroProyecto.Descripcion character 11 0.00 % ×
Centro.Costo integer 74 0.00 % ×
Centro.Costo.Descripcion character 73 0.00 % ×
VIS character 2 0.00 %
Sucursal.Administrativa character 1 0.00 % ×
SkIdEmpresa integer 1 0.00 % ×
Empresa character 1 0.00 % ×
Fecha.De.Elaboracion character 59 0.00 % ×
Fecha.De.Inicio character 61 0.00 % ×
Fecha.De.Finalizacion character 61 0.00 % ×
UnidadAConstruir character 4 0.00 % ×
CantidadAConstruir numeric 12 0.00 % ×
AreaAConstruir_M2 numeric 23 0.00 % ×
AreaConstruidaFinal_M2 numeric 13 1.12 % ×
AreaAVender_M2 numeric 4 0.00 % ×
Arealote_M2 numeric 5 1.12 % ×
CostoPreFactibilidad numeric 2 2.25 % ×
Iniciales logical 1 100.00 % ×
Nocontrato logical 1 100.00 % ×
Cliente character 12 0.00 % ×
Inversionista character 15 0.00 % ×
Almacenista character 14 0.00 % ×
PorcentajeAdministracion numeric 3 0.00 % ×
PorcentajeImprevistos numeric 3 0.00 % ×
PorcentajeUtilidad numeric 3 0.00 % ×
IVA numeric 3 0.00 % ×

Variable list

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 89
Median 100204
1st and 3rd quartiles 100128; 100255
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "10028", "10029", "10030", "10031", "10035" (4 values omitted).

Codigo.Proyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 89
Median 204
1st and 3rd quartiles 128; 255
Min. and max. 3; 295


Nombre.Proyecto

  • The variable is a key (distinct values for each observation).

Clase.Proyecto

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “Admon Delegada sin Representacion”

  • Note that the following levels have at most five observations: "SIN ASIGNAR CLASE PROYECTO".

Tipo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “ADPRO”

  • Note that the following levels have at most five observations: "ADPRO-CBR".

Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “Finalizado”


Presupuesto.Fijo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “NO”


Propietario

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “SIN PROPIETARIO”

  • Note that the following levels have at most five observations: "CAJA COLOMBIANA DE SUBSIDIO FAMILIAR COLSUBSIDIO", "PONTIFICIA UNIVERSIDAD JAVERIANA", "PROMOTORA SOLUCIONES DE VIVIENDA SAS".

Sucursal

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 89
Median 204
1st and 3rd quartiles 128; 255
Min. and max. 3; 295


Sucursal.Nombre

  • The variable is a key (distinct values for each observation).

MacroProyecto

Feature Result
Variable type integer
Number of missing obs. 47 (52.81 %)
Number of unique values 10
Median 14
1st and 3rd quartiles 9.25; 103
Min. and max. 1; 107

  • Note that the following possible outlier values were detected: "1", "3".

MacroProyecto.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 11
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "Caminos de Sie - Manzana 2", "Caminos de Sie - Manzana 4", "Centro Cultural Atrio", "Hotel Four Seasons San Francisco", "Quadro Smart Living", "Valverde Ciprés", "Valverde Roble".


Centro.Costo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 74
Median 2308307
1st and 3rd quartiles 2304409; 2312414
Min. and max. 2200101; 10388801

  • Note that the following possible outlier values were detected: "2200101", "2200102", "2209601", "2209602", "2211001", "2211401", "2214201", "10388801".

Centro.Costo.Descripcion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 73
Mode “Edificaciones Manz 1”

  • The following values appear with prefixed or suffixed white space: "Cipres Directos ", "Cipres Locales NO Vis ", "Costos compartidos Dts Arboleda ", "Directos San Francisco ", "Dosel del Bosque Piscilago ", …, "Plataforma Arboleda Vis ", "Roble Directos ", "Roble Locales NO Vis ", "Urbanismo Externo Arboleda Vis ", "Vive 92 NQS Directos " (4 values omitted).

  • Note that the following levels have at most five observations: "Acabados Fase I CC UAandes", "Centro Civico Univ.Andes", "Cipres Directos ", "Cipres Locales NO Vis ", "Costos compartidos Dts Arboleda ", …, "Urban Interno Manz 4", "Urban P.Principal", "Urbanismo Externo Arboleda Vis ", "Vive 92 NQS Directos ", "VV Nogal Directos" (61 values omitted).


VIS

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “NO”


Sucursal.Administrativa

  • The variable only takes one (non-missing) value: "Principal". The variable contains 0 % missing observations.

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Fecha.De.Elaboracion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 59
Mode “03/02/2015”

  • Note that the following levels have at most five observations: "01/06/2017", "01/10/2024", "01/11/2011", "02/02/2024", "02/04/2014", …, "31/01/2012", "31/05/2023", "31/07/2021", "31/10/2022", "31/10/2024" (48 values omitted).

Fecha.De.Inicio

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 61
Mode “03/10/2022”

  • Note that the following levels have at most five observations: "01/01/2021", "01/02/2020", "01/02/2025", "01/05/2019", "01/06/2021", …, "29/04/2014", "30/08/2011", "31/01/2012", "31/05/2023", "31/10/2022" (49 values omitted).

Fecha.De.Finalizacion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 61
Mode “03/02/2017”

  • Note that the following levels have at most five observations: "01/11/2014", "02/04/2016", "02/07/2019", "03/02/2017", "06/09/2025", …, "31/12/2024", "31/12/2025", "31/12/2026", "31/12/2027", "31/12/2031" (51 values omitted).

UnidadAConstruir

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “m2”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "M2", "un".

  • Note that there might be case problems with the following levels: "m2", "M2".


CantidadAConstruir

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 12
Median 1
1st and 3rd quartiles 0; 1
Min. and max. 0; 540

  • Note that the following possible outlier values were detected: "23", "88", "174", "184", "300", "312", "396", "456", "457", "540".

AreaAConstruir_M2

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 23
Median 1
1st and 3rd quartiles 0; 1
Min. and max. 0; 40996

  • Note that the following possible outlier values were detected: "4700.8", "4953.6", "5021.72", "7238.54", "8013.46", …, "25473.99", "25492.3", "26832", "34670.6", "40996" (11 values omitted).

AreaConstruidaFinal_M2

Feature Result
Variable type numeric
Number of missing obs. 1 (1.12 %)
Number of unique values 12
Median 0
1st and 3rd quartiles 0; 1
Min. and max. 0; 21904.02

  • Note that the following possible outlier values were detected: "4700.8", "4953.6", "7238.54", "10066.39", "10184", "13112.44", "19513.53", "19988", "21464.19", "21904.02".

AreaAVender_M2

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "2479.11", "4556.41".

Arealote_M2

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 1 (1.12 %)
Number of unique values 4
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "1260", "4347.27", "5276.1".

CostoPreFactibilidad

  • The variable only takes one (non-missing) value: "0". The variable contains 2.25 % missing observations.

Iniciales

  • The variable only takes one value: "NA".

Nocontrato

  • The variable only takes one value: "NA".

Cliente

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 12
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "Arpro", "ARPRO ARQUITECTOS INGENIERSO S.A.", "ARPRO INGENIEROS ARQUITECTOS S.A.", "Caja de Compensación Colsubsidio", "CANPACK COLOMBIA SAS", "INVERSIONES FAMOSO", "Pontificia Universidad Javieriana", "QBO", "Solution Investment S.A.S.", "Universidad de los Andes".


Inversionista

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 15
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: "ARPRO ", "ARPRO Arquitectos Ingenieros S.A. ".

  • Note that the following levels have at most five observations: "Arpro", "ARPRO ", "Arpro Arquitecto Ingenieros S.A.", "ARPRO Arquitectos Ingenieros S.A. ", "Chaid Neme", …, "INVERSIONES ARPRO PROVI SAS", "Prominsa", "PROMINSA LTDA", "SOMEC", "Uniandes" (2 values omitted).

  • Note that there might be case problems with the following levels: "Arpro Arquitectos Ingenieros S.A.", "ARPRO Arquitectos Ingenieros S.A.".


Almacenista

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 14
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "Andrea Pedraza", "Carlos Medina", "Fredy Ortiz", "Isidro González", "Juan Carlos Chaparro", …, "Orlando Camacho", "Oscar Fernando Pulido", "Oscar Marca", "Rafael Rodriguez", "Rafael Rodríguez" (1 values omitted).


PorcentajeAdministracion

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "12", "18".

PorcentajeImprevistos

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "3", "4".

PorcentajeUtilidad

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “0”
Reference category 0

  • Note that the following levels have at most five observations: "4", "4.5".

IVA

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “16”
Reference category 0

  • Note that the following levels have at most five observations: "19".

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:29:18

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Tercero

The dataset examined has the following dimensions:

Feature Result
Number of observations 12709
Number of variables 19

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdTercero integer 12709 0.00 %
Nombre character 12673 0.00 % ×
Nit character 12709 0.00 % ×
Contacto character 3471 1.10 % ×
Email character 4235 0.00 % ×
Direccion character 11858 0.00 % ×
Telefono character 10836 0.00 % ×
Tipo character 4 0.00 % ×
Plazo.de.pago numeric 11 0.01 % ×
Ciudad character 146 0.00 % ×
CIIU.Cod integer 497 37.07 % ×
CIIU character 496 0.00 % ×
Estado character 3 0.00 % ×
Especialidad logical 1 100.00 % ×
Categoria logical 1 100.00 % ×
Grupo logical 1 100.00 % ×
Calificacion logical 1 100.00 % ×
Cargo character 851 0.00 % ×
Naturaleza character 3 0.00 % ×

Variable list

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 12709
Median 6354
1st and 3rd quartiles 3177; 9531
Min. and max. -1; 12708


Nombre

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 12673
Mode “360 GRADOS AGENCIA CREATIVA SAS”

  • The following values appear with prefixed or suffixed white space: " AIRALO INC", " CANTATA SA", " CVS PHARMACY", " TRADER JOES", " WHOLE FOODS MARKET INC", …, "ZABALETA BLADIMIR ", "ZERDA ESGUERRA DIEGO LIBARDO ", "ZORRO GUTIERREZ MODESTO ", "ZULUAGA BOTERO JORGE MAURICIO ", "ZULUAGA DUQUE RAMON EUSEBIO " (1310 values omitted).

  • Note that the following levels have at most five observations: " AIRALO INC", " CANTATA SA", " CVS PHARMACY", " TRADER JOES", " WHOLE FOODS MARKET INC", …, "ZUÑIGA SANCHEZ ALEJANDRA MARIA", "ZURICH COLOMBIA SEGUROS S.A.", "ZURICH COLOMBIA SEGUROS SA", "ZURITA GUTIERREZ ALFONSO RAFAEL", "ZYCOL LTDA" (12663 values omitted).


Nit

  • The variable is a key (distinct values for each observation).

Contacto

Feature Result
Variable type character
Number of missing obs. 140 (1.1 %)
Number of unique values 3470
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " CAROLINA GUALTERO URREGO", " CASTRO ANA MARIA", " FRANCIA ACOSTA MARTÍNEZ", " GINA PAOLA CUBILLOS PULIDO", "4311-4312-4923 ", …, "YEINS SMITH ", "YENNIFER ZAPATA ALZATE ", "YHEEFRY ENRIQUEZ SUAREZ ", "YOLANDA SANTAMARÍA ARDILA ", "ZARATE JORGE " (574 values omitted).

  • Note that the following levels have at most five observations: " CAROLINA GUALTERO URREGO", " CASTRO ANA MARIA", " FRANCIA ACOSTA MARTÍNEZ", " GINA PAOLA CUBILLOS PULIDO", ",ARIA CLEMENCIA GOMEZ PRIETO", …, "ZARATE PARRA ORLANDO", "ZENELIA GIRALDO", "ZULEIMA MARTINEZ", "ZULEYMY MENDEZ", "ZULUAGA REINA JULIANA" (3453 values omitted).

  • Note that there might be case problems with the following levels: "claudia carolina castrillo galvis", "CLAUDIA CAROLINA CASTRILLO GALVIS", "Juan Camilo Martin Herrera", "JUAN CAMILO MARTIN HERRERA".


Email

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4235
Mode “”


Direccion

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 11858
Mode “0”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " AD ", " AER PALMIRA", " CL DEL CHORRITO ", " CONDOMINIO ALEJANDRIA AP L-302", " CONJUNTO TORRES DEL", …, "VRDA BOJACA ", "VRE 1 1 1 ", "VTE TOCAIMA EN TERMINAL ", "WORD TRADE CENTER ", "ZARAGOCILLA CALLE EL PORVENIR N. 49-80 " (2231 values omitted).

  • Note that the following levels have at most five observations: "", " AD ", " AER PALMIRA", " CL DEL CHORRITO ", " CONDOMINIO ALEJANDRIA AP L-302", …, "ZN 249 USA", "ZN 767 SUIZA", "ZN 767 Zurich", "ZN TX 77056 1360", "ZN ZF PERMANENTE DEL CAUCA ET1 LOTE 4" (11823 values omitted).

  • Note that there might be case problems with the following levels: "calle 98 8 28 of602", "CALLE 98 8 28 OF602", "CL 84 28b 95", "CL 84 28B 95", "CL 98 8 28 of 602", …, "CONDOMINIO CHORLAVI CS 1 SECTOR EL TIGRE", "CR 13a 90 21 OF 203", "CR 13A 90 21 OF 203", "cra 12-79-50", "CRA 12-79-50" (4 values omitted).


Telefono

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 10836
Mode “6010404”

  • The following suspected missing value codes enter as regular values: "", " ", ".".

  • The following values appear with prefixed or suffixed white space: " ", " 6718605", " 111", " 2465662 - 2171887", " 5311155 ", …, "9071246 ", "9098650 ", "9236046 ", "9260995 ", "CEL 311 - 5730404 " (403 values omitted).

  • Note that the following levels have at most five observations: " ", " 6718605", " 111", " 2465662 - 2171887", " 5311155 ", …, "981179299", "981396252", "981814873", "CEL 311 - 5730404 ", "NO HAY" (10816 values omitted).


Tipo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “P”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "".


Plazo.de.pago

Feature Result
Variable type numeric
Number of missing obs. 1 (0.01 %)
Number of unique values 10
Median 1
1st and 3rd quartiles 1; 30
Min. and max. 0; 90

  • Note that the following possible outlier values were detected: "0".

Ciudad

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 146
Mode “BOGOTÁ D.C.”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "", "AGUA DE DIOS", "AGUACHICA", "AIPE", "ANAPOIMA", …, "YOPAL", "ZETAQUIRA", "ZIPACON", "ZOETERMEER", "ZONA BANANERA" (93 values omitted).


CIIU.Cod

Feature Result
Variable type integer
Number of missing obs. 4711 (37.07 %)
Number of unique values 496
Median 7730
1st and 3rd quartiles 4663; 9900155
Min. and max. 10; 990074221

  • Note that the following possible outlier values were detected: "990041002", "990045111", "990045121", "990045211", "990045212", …, "990074141", "990074142", "990074211", "990074212", "990074221" (11 values omitted).

CIIU

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 496
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "Acabado de productos textiles.", "Actividades combinadas de apoyo a instalaciones.", "Actividades de administración de fondos.", "Actividades de aeropuertos, servicios de navegación aérea y demás actividades conexas al transporte aéreo.", "Actividades de agentes y corredores de seguros", …, "Transporte urbano colectivo regular de pasajeros", "Tratamiento y disposición de desechos no peligrosos.", "Tratamiento y disposición de desechos peligrosos.", "Tratamiento y revestimiento de metales; mecanizado.", "Tratamiento y revestimiento de metales; trabajos de ingeniería mecánica en general realizados a cambio de una retribución o por contrata" (303 values omitted).


Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “Activo”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "".


Especialidad

  • The variable only takes one value: "NA".

Categoria

  • The variable only takes one value: "NA".

Grupo

  • The variable only takes one value: "NA".

Calificacion

  • The variable only takes one value: "NA".

Cargo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 851
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " CARTERA", "ABOGADA ", "ADMINISTRACION ", "ADMINISTRADOR ", "ADMINISTRADOR DE PROYECTOS ", …, "TESORERA ", "TESORERIA ", "TESORERO ", "VENDEDOR ", "VENTAS CONSTRUCTOR " (205 values omitted).

  • Note that the following levels have at most five observations: " CARTERA", "3144578699", "6720287", "ABOGADA ", "ABOGADO", …, "VENDEDOR / GERENTE", "VENDEDORA", "VENTAS", "VENTAS CONSTRUCTOR ", "Vicepresidente de Carteras colectivas" (762 values omitted).

  • Note that there might be case problems with the following levels: "Administrador", "ADMINISTRADOR", "Administradora ", "ADMINISTRADORA ", "analista contabilidad", …, "REPRESENTANTE LEGAL", "Tesoreria", "TESORERIA", "Vendedor", "VENDEDOR" (56 values omitted).


Naturaleza

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “J”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:29:31

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.TipoContrato

The dataset examined has the following dimensions:

Feature Result
Number of observations 64
Number of variables 5

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdTipoContrato integer 64 0.00 %
SkIdEmpresa integer 1 0.00 % ×
Tipo.Codigo character 64 0.00 % ×
Tipo.Descripcion character 64 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdTipoContrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 64
Median 32.5
1st and 3rd quartiles 16.75; 48.25
Min. and max. 1; 64


SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Tipo.Codigo

  • The variable is a key (distinct values for each observation).

Tipo.Descripcion

  • The variable is a key (distinct values for each observation).

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:20

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.TipoDeDescuento

The dataset examined has the following dimensions:

Feature Result
Number of observations 3
Number of variables 2

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdTipoDescuento integer 3 0.00 % ×
Descripcion character 3 0.00 % ×

Variable list

SkIdTipoDescuento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “1”
Reference category 1

  • Note that the following levels have at most five observations: "1", "2", "3".

Descripcion

  • The variable is a key (distinct values for each observation).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:21

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.TiposPolizas

The dataset examined has the following dimensions:

Feature Result
Number of observations 11
Number of variables 3

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdTipoPoliza integer 11 0.00 %
SkIdEmpresa integer 1 0.00 % ×
Descripcion character 11 0.00 % ×

Variable list

SkIdTipoPoliza

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11
Median 1006
1st and 3rd quartiles 1003.5; 10036.5
Min. and max. 1001; 10039


SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Descripcion

  • The variable is a key (distinct values for each observation).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:23

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.Usuario

The dataset examined has the following dimensions:

Feature Result
Number of observations 421
Number of variables 7

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdUsuario integer 421 0.00 % ×
SkIdEmpresa integer 2 0.00 % ×
Nombre character 421 0.00 % ×
Cargo character 177 0.00 % ×
Nivel.Acceso character 54 0.00 % ×
Estado character 3 0.00 % ×
Empresa character 2 0.00 % ×

Variable list

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 421
Median 100304
1st and 3rd quartiles 100199; 100409
Min. and max. 0; 100514

  • Note that the following possible outlier values were detected: "0", "10013", "10043", "10044", "10045", "10048", "10050".

SkIdEmpresa

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “100”
Reference category 0

  • Note that the following levels have at most five observations: "0".

Nombre

  • The variable is a key (distinct values for each observation).

Cargo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 177
Mode “Residente de Obra”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: "Administrador de Obra ", "Auxiliar de compras No1 ", "Director Obra ", "Gerente ", "Gerente de Proyectos ", "Residente de Obra ".

  • Note that the following levels have at most five observations: "Admin1", "Administrador", "Administrador de Obra ", "Administrador de proyecto", "Administrador Obra Engativa", …, "Residente Provisional de Obra", "Revisor Fiscal", "Seguridad Informatica", "SIN CARGO", "Supervisor" (155 values omitted).

  • Note that there might be case problems with the following levels: "Asistente contable", "Asistente Contable", "Auxiliar Control de costos", "Auxiliar Control de Costos", "Director de compras", …, "Gerente de Proyecto", "interventor", "Interventor", "n/a", "N/A" (2 values omitted).


Nivel.Acceso

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 54
Mode “Interventor”

  • The following values appear with prefixed or suffixed white space: "Sandra Mireya ".

  • Note that the following levels have at most five observations: "Acceso Total", "Administrador", "Analista", "Analista Financiero", "Andrea Rada", …, "Revisoría Fiscal", "Sandra Leon", "Seguridad Informatica NH", "SIN NIVEL", "Soporte Arpro" (26 values omitted).


Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “ACTIVO”

  • Note that the following levels have at most five observations: "0".

Empresa

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “ARPRO ARQUITECTOS INGENIEROS S.A.S”

  • Note that the following levels have at most five observations: "SIN EMPRESA".

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:26

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.VariablesAdicionalesContratos

The dataset examined has the following dimensions:

Feature Result
Number of observations 17
Number of variables 5

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdVariablesAdicionalesContratos integer 10 0.00 %
SkIdEmpresa integer 1 0.00 % ×
Variable.configurada character 2 0.00 %
Respuesta.de.variable character 4 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdVariablesAdicionalesContratos

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 10
Median 1002010040
1st and 3rd quartiles 1001860330; 1002360046
Min. and max. 1001190044; 1002750087


SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Variable.configurada

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “Fecha Ultima Propuesta”


Respuesta.de.variable

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "Construcción de edificios residenciales (4111), Construcción de edificios no residenciales (4112).", "Diciembre de 2018", "Enero 6 de 2020".


Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:29

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.VariablesAdicionalesProyecto

The dataset examined has the following dimensions:

Feature Result
Number of observations 0
Number of variables 5

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdVariablesAdicionalesContratos logical 0 NaN % ×
SkIdEmpresa logical 0 NaN % ×
Variable.configurada logical 0 NaN % ×
Respuesta.de.variable logical 0 NaN % ×
Empresa logical 0 NaN % ×

Variable list

SkIdVariablesAdicionalesContratos

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdEmpresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Variable.configurada

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Respuesta.de.variable

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Empresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:32

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_DIM.ZonasProyecto

The dataset examined has the following dimensions:

Feature Result
Number of observations 1
Number of variables 3

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdZona integer 1 0.00 % ×
Zona character 1 0.00 % ×
ProyectoDescripcion character 1 0.00 % ×

Variable list

SkIdZona

  • The variable only takes one (non-missing) value: "1002171". The variable contains 0 % missing observations.

Zona

  • The variable is a key (distinct values for each observation).

  • The variable only takes one (non-missing) value: "Zona Valverdes". The variable contains 0 % missing observations.


ProyectoDescripcion

  • The variable is a key (distinct values for each observation).

  • The variable only takes one (non-missing) value: "217 Valverde - Palma". The variable contains 0 % missing observations.


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:34

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Acta

The dataset examined has the following dimensions:

Feature Result
Number of observations 199600
Number of variables 27

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 80 0.00 % ×
SkIdFecha integer 4251 0.00 % ×
SkIdEstado integer 7 0.00 % ×
SkIdInsumo integer 5437 0.00 % ×
SkIdItems integer 22778 0.00 % ×
SkIdEspecificacionActas numeric 55896 0.00 % ×
SkIdTercero integer 2521 0.00 % ×
Porcentaje.Anticipo numeric 33 0.00 % ×
Valor.Anticipo numeric 2773 0.00 % ×
Porcentaje.Retencion.Antcipo numeric 56 0.00 %
Valor.Retencion.Anticipo numeric 10166 0.00 % ×
Porcentaje.Retencion.Garantia numeric 12 1.47 %
Valor.Retencion.Garantias numeric 16682 0.00 % ×
Valor.Descuentos numeric 1962 0.00 % ×
Valor.Total.Neto numeric 34442 0.00 % ×
Valor.Iva.Total numeric 23380 0.00 % ×
Valor.Total.Acta numeric 40464 0.00 % ×
Cantidad.Acta numeric 40010 2.90 % ×
Valor.Unitario numeric 50765 2.90 % ×
Valor.Iva.Unitario numeric 30544 2.90 % ×
Valor.Total numeric 121678 2.90 % ×
No.Contrato integer 11407 0.00 % ×
Tipo.Acta character 4 0.00 %
No.Acta integer 385 0.00 % ×
Porcentaje.Retencion.Garantia.Fic numeric 3 0.17 %
Valor.Garantias.Fic numeric 6 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 80
Median 100200
1st and 3rd quartiles 100108; 100228
Min. and max. 1003; 100294

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "100280", "100281", "100283", "100288", "100294" (34 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4251
Median 20190516
1st and 3rd quartiles 20160601; 20230329
Min. and max. 20010302; 20251031

  • Note that the following possible outlier values were detected: "20010302".

SkIdEstado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7
Median 1006200001
1st and 3rd quartiles 1006200001; 1006200001
Min. and max. 10068; 1006200005

  • Note that the following possible outlier values were detected: "10068", "100610", "100611", "1006200002", "1006200003", "1006200005".

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 5437
Median 1007763
1st and 3rd quartiles 1002614; 1008952
Min. and max. 1000; 10019219

  • Note that the following possible outlier values were detected: "1000", "100101", "100105", "100106", "100107", …, "10019078", "10019193", "10019194", "10019200", "10019219" (2819 values omitted).

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 22778
Median 10066896
1st and 3rd quartiles 10027329; 10079491
Min. and max. 1000; 100144964

  • Note that the following possible outlier values were detected: "1000", "100101", "1002462", "1002463", "1002464", …, "100144897", "100144898", "100144899", "100144900", "100144964" (5623 values omitted).

SkIdEspecificacionActas

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 55896
Median 10021121100028
1st and 3rd quartiles 10011111100055; 100118118004729
Min. and max. 1003300011; 1002492490076239

  • Note that the following possible outlier values were detected: "1003300011", "1003300012", "1003300021", "1003300022", "1003300023", …, "1003535034613", "1003535036110", "1003535036111", "1003535036112", "1003535036113" (13398 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2521
Median -1
1st and 3rd quartiles -1; -1
Min. and max. -1; 12702

  • Note that the following possible outlier values were detected: "1", "2", "3", "4", "8", …, "12666", "12671", "12673", "12696", "12702" (2510 values omitted).

Porcentaje.Anticipo

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 33
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 1

  • Note that the following possible outlier values were detected: "0.01", "0.05", "0.08", "0.1", "0.1", …, "0.7", "0.75", "0.83", "0.9", "1" (22 values omitted).

Valor.Anticipo

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2773
Median 0
1st and 3rd quartiles 0; 0
Min. and max. -17243225600.07; 20736458922.66

  • Note that the following possible outlier values were detected: "-17243225600.07", "-6891809568.03", "-4019898778.85", "-3973737352.88", "-3863640330.58", …, "9348050550", "10167540191", "12263272491.75", "17243225600.07", "20736458922.66" (2762 values omitted).

Porcentaje.Retencion.Antcipo

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 56
Median 0
1st and 3rd quartiles 0; 0.05
Min. and max. 0; 1


Valor.Retencion.Anticipo

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 10166
Median 0
1st and 3rd quartiles 0; 5231052.11
Min. and max. -9e+08; 5911629988.86

  • Note that the following possible outlier values were detected: "-9e+08", "-667080096.9", "-107711486.67", "-105723075", "-46089496", …, "2032554220.15", "2609380076.05", "2910359447.28", "5452596407.93", "5911629988.86" (371 values omitted).

Porcentaje.Retencion.Garantia

Feature Result
Variable type numeric
Number of missing obs. 2926 (1.47 %)
Number of unique values 11
Median 0
1st and 3rd quartiles 0; 0.1
Min. and max. 0; 0.2


Valor.Retencion.Garantias

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 16682
Median 0
1st and 3rd quartiles 0; 4104172.73
Min. and max. -3617808287.34; 3089802283

  • Note that the following possible outlier values were detected: "-3617808287.34", "-2904842346", "-2759153013.53", "-2184048720.85", "-2062780994", …, "923010566.93", "1386473854.1", "1393185382.36", "2184048720.85", "3089802283" (2734 values omitted).

Valor.Descuentos

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1962
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 515800414.54

  • Note that the following possible outlier values were detected: "2204", "2999", "4350", "4408", "5370.8", …, "290388149.91", "369953352.18", "406048158.79", "450050651.11", "515800414.54" (1951 values omitted).

Valor.Total.Neto

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 34442
Median 14178947.12
1st and 3rd quartiles 1140000; 63202053.38
Min. and max. -2223600323; 9230105669.28

  • Note that the following possible outlier values were detected: "-2223600323", "-1541005315.86", "-1236734210", "-1175830160.27", "-1146903347", …, "6956596103.75", "7199008610.6", "8697933586.85", "9087644610.99", "9230105669.28" (439 values omitted).

Valor.Iva.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 23380
Median 97449.72
1st and 3rd quartiles 0; 515580.04
Min. and max. -422484061.37; 422484061.37

  • Note that the following possible outlier values were detected: "-422484061.37", "-223407730.45", "-186161202", "-95745588.2", "-79818730.26", …, "298355735.08", "309370818.94", "401428889.28", "412494425.25", "422484061.37" (1122 values omitted).

Valor.Total.Acta

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 40464
Median 11900000
1st and 3rd quartiles 1276800; 50299488.74
Min. and max. -17243225600.07; 20736458922.66

  • Note that the following possible outlier values were detected: "-17243225600.07", "-6891809568.03", "-4019898778.85", "-3973737352.88", "-3863640330.58", …, "6595293894", "6891809568.03", "10167540191", "17243225600.07", "20736458922.66" (898 values omitted).

Cantidad.Acta

Feature Result
Variable type numeric
Number of missing obs. 5798 (2.9 %)
Number of unique values 40009
Median 3.24
1st and 3rd quartiles 1; 53
Min. and max. -1236734210; 6793438200

  • Note that the following possible outlier values were detected: "-1236734210", "-853086836", "-407797962", "-393317939.2", "-327637756.3", …, "3785480682.6", "4160496630", "4160496634.6", "5123787390", "6793438200" (12162 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 5798 (2.9 %)
Number of unique values 50764
Median 80000
1st and 3rd quartiles 14288.4; 740000
Min. and max. -13240606.52; 30420118402.79

  • Note that the following possible outlier values were detected: "-13240606.52", "15124846", "15126050", "15126050.5", "15128383", …, "1869220608.09", "3736726701.74", "4103141026", "4959563705.23", "30420118402.79" (2827 values omitted).

Valor.Iva.Unitario

Feature Result
Variable type numeric
Number of missing obs. 5798 (2.9 %)
Number of unique values 30543
Median 150.65
1st and 3rd quartiles 0; 4853.26
Min. and max. 0; 779596794.94

  • Note that the following possible outlier values were detected: "130857.76", "130910", "130960.11", "130972.1", "130983.84", …, "188698247.49", "210046338.38", "215221041.37", "282350244.8", "779596794.94" (4665 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 5798 (2.9 %)
Number of unique values 121677
Median 1045786.02
1st and 3rd quartiles 168600; 4588964.56
Min. and max. -2168581138.44; 7087702769.07

  • Note that the following possible outlier values were detected: "-2168581138.44", "-1379691004.28", "-1236734210", "-877671962.32", "-853086836", …, "3965669563.09", "4358536269.59", "4358536274.41", "5340134188.76", "7087702769.07" (6280 values omitted).

No.Contrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11407
Median 2000047
1st and 3rd quartiles 1080111; 2280270
Min. and max. 30001; 2940002

  • Note that the following possible outlier values were detected: "2490001", "2490002", "2490003", "2490004", "2490005", …, "2880098", "2880100", "2880102", "2940001", "2940002" (1738 values omitted).

Tipo.Acta

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “ACTAS GRUPOS”


No.Acta

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 385
Median 6
1st and 3rd quartiles 2; 12
Min. and max. 1; 385

  • Note that the following possible outlier values were detected: "60", "61", "62", "63", "64", …, "381", "382", "383", "384", "385" (316 values omitted).

Porcentaje.Retencion.Garantia.Fic

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type numeric
Number of missing obs. 335 (0.17 %)
Number of unique values 2
Mode “0”
Reference category 0


Valor.Garantias.Fic

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 6
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 390399.35

  • Note that the following possible outlier values were detected: "44030", "122200.76", "125100.45", "244402.46", "390399.35".

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:47

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.ActasDescuentos

The dataset examined has the following dimensions:

Feature Result
Number of observations 6808
Number of variables 12

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 67 0.00 % ×
SkIdTercero integer 328 0.00 %
SkIdEspecificacionDeContratos numeric 982 0.00 % ×
SkIdEspecificacionActas numeric 2238 0.00 % ×
SkIdItems integer 379 0.00 % ×
SkIdInsumo integer 1387 0.00 % ×
SkIdFecha integer 1354 0.00 %
SkIdTipoDescuento integer 2 0.00 %
Valro.Descuento numeric 3403 0.00 % ×
Cantidad.Descuento numeric 1136 0.00 % ×
Total.Descuento numeric 4866 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 67
Median 100200
1st and 3rd quartiles 100108; 100226
Min. and max. 1003; 100278

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "100269", "100275", "100276", "100277", "100278" (23 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 328
Median 7292
1st and 3rd quartiles 5141; 11938
Min. and max. 22; 12701


SkIdEspecificacionDeContratos

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 982
Median 1002002000005
1st and 3rd quartiles 1001081080029; 1002262260083
Min. and max. 100330016; 1002782780003

  • Note that the following possible outlier values were detected: "100330016", "100330029", "100330045", "100330065", "100330074", …, "1002752750100", "1002762760065", "1002772770057", "1002782780002", "1002782780003" (460 values omitted).

SkIdEspecificacionActas

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2238
Median 10024024000345
1st and 3rd quartiles 10011111100059; 100188188000640
Min. and max. 1003300163; 1002282280489359

  • Note that the following possible outlier values were detected: "1003300163", "1003300164", "1003300657", "1003300749", "1003300772", …, "1003535023511", "1003535023512", "1003535023517", "1003535023518", "1003535023610" (497 values omitted).

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 379
Median 10067005
1st and 3rd quartiles 10027175; 10083085
Min. and max. 1002553; 100144871

  • Note that the following possible outlier values were detected: "1002553", "1002647", "1003061", "1003289", "1005207", …, "100126038", "100127286", "100139616", "100140849", "100144871" (82 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1387
Median 1004830.5
1st and 3rd quartiles 1001957; 1008966
Min. and max. 100106; 10019101

  • Note that the following possible outlier values were detected: "100106", "100116", "100119", "100139", "100140", …, "10018180", "10018199", "10018938", "10018946", "10019101" (571 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1354
Median 20191113
1st and 3rd quartiles 20160406.75; 20230203
Min. and max. 20110429; 20251031


SkIdTipoDescuento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “1”
Reference category 1


Valro.Descuento

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3403
Median 40000
1st and 3rd quartiles 5000; 375982
Min. and max. 0; 8434389076

  • Note that the following possible outlier values were detected: "6960000", "6978874", "7e+06", "7003962", "7072325", …, "161412309", "219080642", "224454725.46", "227339743.59", "8434389076" (121 values omitted).

Cantidad.Descuento

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1136
Median 4
1st and 3rd quartiles 1; 30
Min. and max. 0; 103757824

  • Note that the following possible outlier values were detected: "590", "596", "598", "600", "601.16", …, "387157", "458900", "750000", "2885231.71", "103757824" (223 values omitted).

Total.Descuento

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 4866
Median 360000
1st and 3rd quartiles 83889.01; 1291389.6
Min. and max. 0; 224454725.46

  • Note that the following possible outlier values were detected: "12844752.76", "12969400", "12992000", "1.3e+07", "13003061", …, "161412309", "190950000", "197477060", "219080642", "224454725.46" (136 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:30:56

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Anticipo

The dataset examined has the following dimensions:

Feature Result
Number of observations 1461
Number of variables 11

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 66 0.00 %
SkIdTercero integer 261 0.00 % ×
SkIdFechaAnticipo integer 916 0.00 %
SkIdFechaPago numeric 596 26.97 %
SkIdUsuario integer 36 0.00 % ×
SkIdEstado integer 5 0.00 % ×
Anticipo.Numero integer 1360 0.00 %
Porcentaje.Amortizado numeric 15 0.00 % ×
Valor.Anticipo numeric 1168 0.00 % ×
Factura character 527 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 66
Median 100157
1st and 3rd quartiles 10031; 100225
Min. and max. 1003; 100275


SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 261
Median 5061
1st and 3rd quartiles 4814; 8624
Min. and max. 155; 12678

  • Note that the following possible outlier values were detected: "155", "205", "300", "325", "407", …, "4463", "4515", "4553", "4606", "4610" (42 values omitted).

SkIdFechaAnticipo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 916
Median 20181109
1st and 3rd quartiles 20150414; 20230216
Min. and max. 20110709; 20251027


SkIdFechaPago

Feature Result
Variable type numeric
Number of missing obs. 394 (26.97 %)
Number of unique values 595
Median 20191214
1st and 3rd quartiles 20161021; 20231006.5
Min. and max. 20110728; 20251119


SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 36
Median 100203
1st and 3rd quartiles 100149; 100268
Min. and max. 10070; 100499

  • Note that the following possible outlier values were detected: "10070".

SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 5
Mode “10033”
Reference category 10030

  • Note that the following levels have at most five observations: "10031".

Anticipo.Numero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1360
Median 924
1st and 3rd quartiles 331; 1398
Min. and max. 1; 1790


Porcentaje.Amortizado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 15
Median 100
1st and 3rd quartiles 100; 100
Min. and max. 0; 100

  • Note that the following possible outlier values were detected: "0", "20", "30", "40", "42", …, "80", "90.48", "97", "98", "99" (4 values omitted).

Valor.Anticipo

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1168
Median 24197068
1st and 3rd quartiles 4973442.82; 83664448
Min. and max. -5e+08; 4724216985

  • Note that the following possible outlier values were detected: "-5e+08", "-275783015", "-267004992", "-115294457", "-73703250", …, "2.5e+09", "2533506204", "3e+09", "3845092554", "4724216985" (53 values omitted).

Factura

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 527
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • The following values appear with prefixed or suffixed white space: " 02 08 - 2021", "42472 ", "46972 ", "AJUSTE ANTICIPO ", "ajuste anticipo por ", …, "PROFORMA ", "PROFORMA No: 0811 - ", "PROFORMA No: 2402 - ", "PROFORMA No: 2709 - ", "Traslado de menta a " (12 values omitted).

  • Note that the following levels have at most five observations: " 02 08 - 2021", "0001", "01", "010-2023", "010-23", …, "Traslado Terranum", "TrasladoMenta", "VIENE DE ETAPA 1", "WP982024", "YFA-0105" (516 values omitted).

  • Note that there might be case problems with the following levels: "Cuenta de Cobro No. ", "CUENTA DE COBRO NO. ".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:01

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.AuditoriaActas

The dataset examined has the following dimensions:

Feature Result
Number of observations 46144
Number of variables 7

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 56 0.00 % ×
SkIdFecha integer 1561 0.00 %
SkIdUsuario integer 34 0.00 % ×
No..Contrato integer 4989 0.00 % ×
No..Acta integer 385 0.00 % ×
Descripcion.Estado character 7 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 56
Median 100211
1st and 3rd quartiles 100201; 100236
Min. and max. 10029; 100294

  • Note that the following possible outlier values were detected: "10029", "10031", "10035", "100117", "100118", …, "100170", "100171", "100173", "100174", "100184" (8 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1561
Median 20230629
1st and 3rd quartiles 20211230; 20240911
Min. and max. 20191118; 20251031


SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 34
Median 100187
1st and 3rd quartiles 100141; 100324
Min. and max. 100103; 100512

  • Note that the following possible outlier values were detected: "100103".

No..Contrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4989
Median 2110065.5
1st and 3rd quartiles 2010013; 2360038
Min. and max. 290185; 2940002

  • Note that the following possible outlier values were detected: "290185", "290199", "290206", "290207", "290208", …, "1840260", "1840261", "1840262", "1840263", "1840264" (954 values omitted).

No..Acta

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 385
Median 4
1st and 3rd quartiles 2; 10
Min. and max. 1; 385

  • Note that the following possible outlier values were detected: "64", "65", "66", "67", "68", …, "381", "382", "383", "384", "385" (312 values omitted).

Descripcion.Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 7
Mode “Programación de Actas”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:04

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.AuditoriaContratos

The dataset examined has the following dimensions:

Feature Result
Number of observations 41595
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 81 0.00 % ×
SkIdFecha integer 3661 0.00 % ×
SkIdUsuario integer 136 0.00 % ×
No..Contrato integer 11950 0.00 % ×
Descripcion.Estado character 10 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 81
Median 100210
1st and 3rd quartiles 100188; 100236
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "10031", "10035", "100288", "100294", "100295" (7 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3661
Median 20230310
1st and 3rd quartiles 20210212; 20240706
Min. and max. 20110616; 20251031

  • Note that the following possible outlier values were detected: "20250102", "20250103", "20250104", "20250107", "20250108", …, "20251027", "20251028", "20251029", "20251030", "20251031" (231 values omitted).

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 136
Median 100210
1st and 3rd quartiles 100141; 100324
Min. and max. 10051; 100514

  • Note that the following possible outlier values were detected: "10051", "10069", "10070", "10073", "10081", "10086".

No..Contrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11950
Median 2100212
1st and 3rd quartiles 1880148; 2360002
Min. and max. 30001; 2950003

  • Note that the following possible outlier values were detected: "30001", "30002", "30003", "30004", "30005", …, "350437", "350438", "350439", "350440", "350441" (3110 values omitted).

Descripcion.Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 10
Mode “CREACION”

  • Note that the following levels have at most five observations: "Desaprobado por apertura de grupo".

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:07

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.AuditoriaEntradasAlmacen

The dataset examined has the following dimensions:

Feature Result
Number of observations 31420
Number of variables 6

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 47 0.00 % ×
SkIdFecha integer 1124 0.00 %
SkIdUsuario integer 29 0.00 %
No..Entrada integer 29188 0.00 % ×
Descripcion.Estado character 7 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 47
Median 100217
1st and 3rd quartiles 100204; 100230
Min. and max. 10029; 100294

  • Note that the following possible outlier values were detected: "10029", "10031", "10035", "100118", "100132", …, "100170", "100171", "100173", "100174", "100294" (6 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1124
Median 20230928
1st and 3rd quartiles 20220526; 20240903
Min. and max. 20191118; 20251031


SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 29
Median 100213
1st and 3rd quartiles 100141; 100349
Min. and max. 100103; 100463


No..Entrada

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 29188
Median 21700788.5
1st and 3rd quartiles 20400400; 23000283
Min. and max. 290587; 29400005

  • Note that the following possible outlier values were detected: "290587", "290588", "290589", "290590", "290591", …, "29400001", "29400002", "29400003", "29400004", "29400005" (7258 values omitted).

Descripcion.Estado

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 7
Mode “Programación y Aprobación de Entradas de Almacén”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:09

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.AuditoriaPedidos

The dataset examined has the following dimensions:

Feature Result
Number of observations 364087
Number of variables 8

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 59 0.00 % ×
SkIdPedido integer 55504 0.00 % ×
SkIdInsumo integer 5581 0.00 % ×
SkIdFecha integer 1996 0.00 % ×
SkIdUsuario integer 83 0.00 % ×
SkIdEstado integer 7 0.00 % ×
EventoPedido character 5 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 59
Median 100217
1st and 3rd quartiles 100211; 100239
Min. and max. 1006; 100295

  • Note that the following possible outlier values were detected: "1006", "1009", "10021", "10029", "10030", …, "100171", "100173", "100174", "100184", "100188" (16 values omitted).

SkIdPedido

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 55504
Median 100117732
1st and 3rd quartiles 100102537; 100131545
Min. and max. 10019331; 100141328

  • Note that the following possible outlier values were detected: "10019331", "10026009", "10027787", "10027788", "10027789", …, "100141324", "100141325", "100141326", "100141327", "100141328" (20810 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 5581
Median 1003590
1st and 3rd quartiles 1001985; 10010498
Min. and max. 100101; 10019323

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "100992", "100994", "100997", "100998", "100999" (363 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1996
Median 20240221
1st and 3rd quartiles 20230118; 20241112
Min. and max. 20131024; 20251031

  • Note that the following possible outlier values were detected: "20250102", "20250103", "20250107", "20250108", "20250109", …, "20251027", "20251028", "20251029", "20251030", "20251031" (232 values omitted).

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 83
Median 100287
1st and 3rd quartiles 100230; 100370
Min. and max. 100103; 100513

  • Note that the following possible outlier values were detected: "100103", "100110".

SkIdEstado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7
Median 10073
1st and 3rd quartiles 10073; 10073
Min. and max. -10075; 10073

  • Note that the following possible outlier values were detected: "-10075", "-10072", "-10071", "10070", "10071", "10072".

EventoPedido

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 5
Mode “Creacion”


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:14

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Compras

The dataset examined has the following dimensions:

Feature Result
Number of observations 128467
Number of variables 20

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 72 0.00 % ×
SkIdTercero integer 916 0.00 % ×
SkIdFechaCompra integer 3678 0.00 %
SkIdFechaEntrega integer 4048 0.00 % ×
SkIdFechaCierre numeric 1505 70.32 %
SkIdEstado integer 7 0.00 % ×
SkIdInsumo integer 7739 0.00 % ×
SkIdItems integer 14591 0.00 % ×
SkIdUsuario integer 45 0.00 % ×
SkIdOrigenDelDocumento integer 4 0.00 %
SkIdEstadoEnvioDocumento integer 3 0.00 %
Compra.No integer 28261 0.00 %
Cantidad.Comprada numeric 21068 0.00 % ×
Valor.Unitario numeric 22720 0.00 % ×
IVA numeric 6 0.00 %
Descuento numeric 105 0.00 % ×
Valor.Neto numeric 24597 0.00 % ×
Valor.IVA numeric 59543 0.00 % ×
Valor.Total numeric 65267 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 72
Median 100184
1st and 3rd quartiles 100108; 100226
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "100277", "100278", "100283", "100294", "100295" (26 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 916
Median 5674
1st and 3rd quartiles 4463; 8972
Min. and max. 71; 12696

  • Note that the following possible outlier values were detected: "71", "85", "98", "126", "155", …, "2531", "2534", "2535", "2536", "2546" (152 values omitted).

SkIdFechaCompra

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3678
Median 20190718
1st and 3rd quartiles 20151007; 20231023
Min. and max. 20110616; 20251031


SkIdFechaEntrega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4048
Median 20190518
1st and 3rd quartiles 20150912; 20231003
Min. and max. 19000101; 20430702

  • Note that the following possible outlier values were detected: "19000101", "20360709", "20400316", "20430702".

SkIdFechaCierre

Feature Result
Variable type numeric
Number of missing obs. 90336 (70.32 %)
Number of unique values 1504
Median 20200904
1st and 3rd quartiles 20170124; 20240514
Min. and max. 20110705; 20251029


SkIdEstado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7
Median 10023
1st and 3rd quartiles 10023; 10024
Min. and max. 10020; 10027

  • Note that the following possible outlier values were detected: "10020", "10021", "10022".

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7739
Median 1004391
1st and 3rd quartiles 1001664; 1009023
Min. and max. 100101; 10019323

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10019275", "10019320", "10019321", "10019322", "10019323" (4070 values omitted).

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 14591
Median 10062373
1st and 3rd quartiles 10027093; 10089774
Min. and max. 100; 100145625

  • Note that the following possible outlier values were detected: "100", "1002499", "1002503", "1002508", "1002510", …, "100144960", "100144961", "100144962", "100144963", "100145625" (4015 values omitted).

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 45
Median 100230
1st and 3rd quartiles 100164; 100287
Min. and max. 10048; 100499

  • Note that the following possible outlier values were detected: "10048", "100390", "100400", "100407", "100411", …, "100428", "100432", "100440", "100444", "100499" (1 values omitted).

SkIdOrigenDelDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “4”
Reference category 3


SkIdEstadoEnvioDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “-1”
Reference category -1


Compra.No

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 28261
Median 13400053
1st and 3rd quartiles 1150065; 21700588
Min. and max. 30001; 29500001


Cantidad.Comprada

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 21068
Median 10
1st and 3rd quartiles 2; 80
Min. and max. 0; 1811700.43

  • Note that the following possible outlier values were detected: "1442.3", "1442.65", "1443", "1444.91", "1445.7", …, "380750.8", "409500", "409740", "433836", "1811700.43" (5578 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 22720
Median 11623
1st and 3rd quartiles 3150; 59980
Min. and max. 0; 1087483487

  • Note that the following possible outlier values were detected: "857059", "857143", "860000", "862093", "863200", …, "266874835", "284078700", "357817674.13", "763962418", "1087483487" (1602 values omitted).

IVA

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 6
Median 0.19
1st and 3rd quartiles 0.16; 0.19
Min. and max. 0; 0.19


Descuento

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 105
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 1

  • Note that the following possible outlier values were detected: "0.01", "0.01", "0.02", "0.02", "0.02", …, "0.65", "0.67", "0.83", "0.83", "1" (94 values omitted).

Valor.Neto

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 24597
Median 11400
1st and 3rd quartiles 3076.64; 58805
Min. and max. 0; 1087483487

  • Note that the following possible outlier values were detected: "845000", "845203", "845270", "846198.9", "846303", …, "266874835", "284078700", "357817674.13", "763962418", "1087483487" (1640 values omitted).

Valor.IVA

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 59543
Median 27184.25
1st and 3rd quartiles 3192; 199972.98
Min. and max. 0; 1170358477.78

  • Note that the following possible outlier values were detected: "3478777.83", "3480000", "3480203.06", "3481484", "3481558.08", …, "488161715.72", "488734176", "563811533.33", "730710000", "1170358477.78" (3274 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 65267
Median 237800
1st and 3rd quartiles 37096.8; 1527138.9
Min. and max. 0; 7330139939.78

  • Note that the following possible outlier values were detected: "24816141", "24816736", "24823066.8", "24826843.86", "24828445.6", …, "3057433903.72", "3531240656.12", "3543322776", "5297647500", "7330139939.78" (3219 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:21

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Contrato

The dataset examined has the following dimensions:

Feature Result
Number of observations 143120
Number of variables 20

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
Empresa character 1 0.00 % ×
SkIdProyecto integer 81 0.00 % ×
SkIdTercero integer 1632 0.00 %
SkIdInsumo integer 6046 0.00 % ×
SkIdItems integer 26662 0.00 % ×
SkIdTipoContrato integer 53 0.00 % ×
SKIdEstado integer 4 0.00 %
SkIdVariablesAdicionalesContratos integer 11916 0.00 % ×
SkIdEspecificacionDeContratos numeric 11916 0.00 % ×
Cantidad.Inicial numeric 13664 0.00 % ×
Cantidad numeric 21067 0.00 % ×
Valor.Unitario numeric 57008 0.00 % ×
Valor.Iva numeric 36717 0.00 % ×
Valor.Total numeric 79947 0.00 % ×
Valor.Contrato.Sin.IVA numeric 6541 31.10 % ×
Valor.Contrato numeric 9023 1.99 % ×
Numero.De.Grupo integer 470 0.00 % ×
Valor.Detalle numeric 69048 0.00 % ×
Valor.Detalle.Unitario numeric 60252 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 81
Median 100200
1st and 3rd quartiles 100111; 100228
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "100281", "100283", "100288", "100294", "100295" (35 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1632
Median 9058
1st and 3rd quartiles 5082; 11778
Min. and max. 1; 12708


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 6046
Median 1006715
1st and 3rd quartiles 1002351; 1009361
Min. and max. 100; 10019348

  • Note that the following possible outlier values were detected: "100", "100101", "100105", "100106", "100107", …, "10019344", "10019345", "10019346", "10019347", "10019348" (3309 values omitted).

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 26662
Median 10067747
1st and 3rd quartiles 10028783; 10082227
Min. and max. 100; 100145469

  • Note that the following possible outlier values were detected: "100", "100101", "1002462", "1002463", "1002464", …, "100145446", "100145447", "100145448", "100145468", "100145469" (7531 values omitted).

SkIdTipoContrato

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 53
Median 25
1st and 3rd quartiles 22; 47
Min. and max. 2; 63

  • Note that the following possible outlier values were detected: "2", "3", "4", "5", "6", …, "11", "13", "15", "17", "18" (3 values omitted).

SKIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “10011”
Reference category 10010


SkIdVariablesAdicionalesContratos

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11916
Median 1002000103
1st and 3rd quartiles 1001110044; 1002280082
Min. and max. 10030001; 1002950003

  • Note that the following possible outlier values were detected: "10030001", "10030002", "10030003", "10030004", "10030005", …, "1002940002", "1002940003", "1002950001", "1002950002", "1002950003" (5452 values omitted).

SkIdEspecificacionDeContratos

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 11916
Median 1002002000103
1st and 3rd quartiles 1001111110044; 1002282280082
Min. and max. 100330001; 1002952950003

  • Note that the following possible outlier values were detected: "100330001", "100330002", "100330003", "100330004", "100330005", …, "1002942940002", "1002942940003", "1002952950001", "1002952950002", "1002952950003" (5452 values omitted).

Cantidad.Inicial

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 13664
Median 0
1st and 3rd quartiles 0; 4
Min. and max. 0; 28572949385.16

  • Note that the following possible outlier values were detected: "124.58", "124.66", "124.69", "124.73", "124.77", …, "15001503196.54", "15017221162.63", "15778660329.18", "16209633178.65", "28572949385.16" (8903 values omitted).

Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 21067
Median 1
1st and 3rd quartiles 1; 23.6
Min. and max. -1236734210; 16209633178.65

  • Note that the following possible outlier values were detected: "-1236734210", "-853086836", "-735144947.28", "-287984386.38", "-158618945", …, "14842344039.45", "14968813010.63", "15226444981", "15778660260", "16209633178.65" (8001 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 57008
Median 99365
1st and 3rd quartiles 18228.6; 981979.19
Min. and max. -13240606.52; 30420118402.79

  • Note that the following possible outlier values were detected: "-13240606.52", "20156772", "20175017.18", "20175017.28", "20191186", …, "1869220608.09", "3736726701.74", "4103141026", "4959563705.23", "30420118402.79" (2702 values omitted).

Valor.Iva

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 36717
Median 107.22
1st and 3rd quartiles 0; 5179.91
Min. and max. 0; 779596794.94

  • Note that the following possible outlier values were detected: "145109.09", "145120", "145223.5", "145240.94", "145302.5", …, "188698247.49", "210046338.38", "215221041.37", "282350244.8", "779596794.94" (5913 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 79947
Median 665512.47
1st and 3rd quartiles 71400; 3761120.75
Min. and max. -1236734210; 16981211717.95

  • Note that the following possible outlier values were detected: "-1236734210", "-853086836", "-766988485.82", "-619738612.5", "-549686289.32", …, "15226444981", "15485255013.86", "15617202115", "16462128707.82", "16981211717.95" (4932 values omitted).

Valor.Contrato.Sin.IVA

Feature Result
Variable type numeric
Number of missing obs. 44517 (31.1 %)
Number of unique values 6540
Median 90213429.44
1st and 3rd quartiles 19747220.84; 503972858.67
Min. and max. -0.03; 117924027174.96

  • Note that the following possible outlier values were detected: "8405609363", "8463006684.69", "8588437925.16", "8870630811.86", "9173639969", …, "21271989244.18", "21840487208.4", "22773002434", "91737737208.85", "117924027174.96" (20 values omitted).

Valor.Contrato

Feature Result
Variable type numeric
Number of missing obs. 2848 (1.99 %)
Number of unique values 9022
Median 50266796.39
1st and 3rd quartiles 1e+07; 4.5e+08
Min. and max. -1193561299; 118785779681.24

  • Note that the following possible outlier values were detected: "-1193561299", "-1165957002", "-853086836", "-599669736.63", "-259079072", …, "21412051724.8", "21963218776.81", "27099872896.46", "92349912228.67", "118785779681.24" (36 values omitted).

Numero.De.Grupo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 470
Median 0
1st and 3rd quartiles 0; 14
Min. and max. 0; 469

  • Note that the following possible outlier values were detected: "436", "437", "438", "439", "440", …, "465", "466", "467", "468", "469" (24 values omitted).

Valor.Detalle

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 69048
Median 160000
1st and 3rd quartiles 27000; 1285931.53
Min. and max. -1236734210; 30539808138.04

  • Note that the following possible outlier values were detected: "-1236734210", "-853086836", "-619738612.5", "-549686289.32", "-499917310.55", …, "3762322595.26", "4882737820.94", "4988171792", "4993535808.27", "30539808138.04" (3364 values omitted).

Valor.Detalle.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 60252
Median 96158.98
1st and 3rd quartiles 17900; 901324
Min. and max. -13240606.52; 30539808138.04

  • Note that the following possible outlier values were detected: "-13240606.52", "18325098", "18326550", "18327575", "18331937.08", …, "4250691524.7", "4882737820.94", "4988171792", "4993535808.27", "30539808138.04" (2878 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:31

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.ContratosPolizas

The dataset examined has the following dimensions:

Feature Result
Number of observations 5770
Number of variables 11

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 47 0.00 % ×
SkIdContrato numeric 1167 0.00 % ×
SkIdTercero integer 341 0.00 %
SkIdEstado integer 3 0.00 %
SkIdFechaVigenciaDesde integer 1075 0.00 % ×
SkIdFechaVigenciaHasta integer 2280 0.00 % ×
SkIdTipoPoliza integer 11 0.00 %
PolizaNumero character 2315 0.00 % ×
ValorAsegurado numeric 3832 0.00 % ×
PorcentajeAsegurado numeric 432 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 47
Median 100210
1st and 3rd quartiles 100157; 100230
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1009", "10012", "10013", …, "100278", "100279", "100283", "100294", "100295" (17 values omitted).

SkIdContrato

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1167
Median 1002102100156
1st and 3rd quartiles 1001571570009; 1002302300247.75
Min. and max. 100330001; 1002952950001

  • Note that the following possible outlier values were detected: "100330001", "100330002", "100330003", "100330004", "100330006", …, "1002832830024", "1002942940001", "1002942940002", "1002942940003", "1002952950001" (476 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 341
Median 8434
1st and 3rd quartiles 5279; 11914
Min. and max. 22; 12704


SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “100111”
Reference category -100111


SkIdFechaVigenciaDesde

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1075
Median 20230115
1st and 3rd quartiles 20181115; 20240701
Min. and max. 19230821; 20270117

  • Note that the following possible outlier values were detected: "19230821", "20250101", "20250107", "20250110", "20250113", …, "20260818", "20260901", "20260930", "20261027", "20270117" (132 values omitted).

SkIdFechaVigenciaHasta

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2280
Median 20240531
1st and 3rd quartiles 20191219.25; 20260612
Min. and max. 20110331; 20311027

  • Note that the following possible outlier values were detected: "20290116", "20290117", "20290123", "20290129", "20290130", …, "20310818", "20310901", "20310930", "20311005", "20311027" (146 values omitted).

SkIdTipoPoliza

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11
Median 1004
1st and 3rd quartiles 1002; 10023
Min. and max. 1001; 10039


PolizaNumero

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2315
Mode “1000171371001”

  • The following values appear with prefixed or suffixed white space: " 21-40-101177568", " 3780475-1", " BQ-100063888", " NB-100049724", " NB-100247249", "65-54-101006364 ", "CBO-100015876 ", "SEPL-23572249-1 ".

  • Note that the following levels have at most five observations: " 21-40-101177568", " 3780475-1", " BQ-100063888", " NB-100049724", " NB-100247249", …, "NB 100387945", "No 3120402", "SEPL-23572249-1", "SEPL-23572249-1 ", "SEPL10689104-1" (2257 values omitted).


ValorAsegurado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 3832
Median 17918540.1
1st and 3rd quartiles 4663721.85; 69541714.7
Min. and max. 0; 54177206177.78

  • Note that the following possible outlier values were detected: "810494557.07", "811212691.04", "812601173.32", "816998715.9", "816998715.99", …, "4592107007", "5025958997.09", "6617217637", "10167540191", "54177206177.78" (85 values omitted).

PorcentajeAsegurado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 432
Median 20
1st and 3rd quartiles 19.54; 30
Min. and max. 0; 205560.16

  • Note that the following possible outlier values were detected: "0", "0", "0", "0.25", "0.32", …, "95.49", "100", "826.63", "1003.29", "205560.16" (134 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:31:35

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.ControlProyecto

The dataset examined has the following dimensions:

Feature Result
Number of observations 1632800
Number of variables 13

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
Empresa character 1 0.00 % ×
SkIdProyecto integer 88 0.00 % ×
SkIdFecha integer 4579 0.00 % ×
SkIdClaseOrigen integer 22 0.00 % ×
SkIdInsumo integer 13322 0.00 % ×
SkIdCapitulo numeric 1996 0.00 %
SkIdItems numeric 47601 0.82 % ×
Cantidad numeric 226445 0.00 % ×
Valor.Total numeric 579815 0.00 % ×
Origen.Documento numeric 397538 0.10 % ×
Origen.Documento.Detalle integer 7394 0.00 % ×
Valor.Sin.IVA numeric 439582 13.11 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 88
Median 100174
1st and 3rd quartiles 100110; 100226
Min. and max. 1003; 100295

  • Note that the following possible outlier values were detected: "1003", "1005", "1006", "1009", "10011", …, "100281", "100283", "100288", "100294", "100295" (31 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4579
Median 20180831
1st and 3rd quartiles 20140416; 20230502
Min. and max. 19000101; 20260201

  • Note that the following possible outlier values were detected: "19000101".

SkIdClaseOrigen

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 22
Median 13
1st and 3rd quartiles 10; 29
Min. and max. 1; 33

  • Note that the following possible outlier values were detected: "1", "2", "3", "5", "6", "7".

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 13322
Median 1005402
1st and 3rd quartiles 1001985; 1008730
Min. and max. 100101; 10019356

  • Note that the following possible outlier values were detected: "100101", "100105", "100106", "100107", "100108", …, "10019352", "10019353", "10019354", "10019355", "10019356" (7531 values omitted).

SkIdCapitulo

Feature Result
Variable type numeric
Number of missing obs. 38 (0 %)
Number of unique values 1995
Median 1001712228
1st and 3rd quartiles 100351175; 1002263096
Min. and max. -910024120; 1002954543


SkIdItems

Feature Result
Variable type numeric
Number of missing obs. 13438 (0.82 %)
Number of unique values 47600
Median 10064713
1st and 3rd quartiles 10028648; 10085206
Min. and max. 100; 100145937

  • Note that the following possible outlier values were detected: "100", "1000", "100101", "1002462", "1002463", …, "100145875", "100145876", "100145877", "100145936", "100145937" (18217 values omitted).

Cantidad

Feature Result
Variable type numeric
Number of missing obs. 35 (0 %)
Number of unique values 226444
Median 4
1st and 3rd quartiles 1; 49
Min. and max. -8.145313e+13; 8.145324e+13

  • Note that the following possible outlier values were detected: "-8.145313e+13", "-173596289527.98", "-129549469797", "-108497681110.76", "-107957891652.5", …, "107957892710", "108497682173.55", "129549471252", "173596291477.68", "8.145324e+13" (81942 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 579815
Median 288494.73
1st and 3rd quartiles 24584.4; 2133066.75
Min. and max. -521909894612433; 521909894612433

  • Note that the following possible outlier values were detected: "-521909894612433", "-516969750214325", "-493906259178344", "-474733570021391", "-443849906571097", …, "443849906571097", "474733570495248", "493906259178344", "516969755846935", "521909894612433" (107275 values omitted).

Origen.Documento

Feature Result
Variable type numeric
Number of missing obs. 1653 (0.1 %)
Number of unique values 397537
Median 118986
1st and 3rd quartiles 27984; 243217
Min. and max. -1; 174000235

  • Note that the following possible outlier values were detected: "1080000", "1080001", "1080002", "1080003", "1080004", …, "174000231", "174000232", "174000233", "174000234", "174000235" (41548 values omitted).

Origen.Documento.Detalle

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7394
Median -1
1st and 3rd quartiles -1; -1
Min. and max. -1; 86399

  • Note that the following possible outlier values were detected: "0", "21", "22", "23", "24", …, "86192", "86204", "86219", "86222", "86399" (7383 values omitted).

Valor.Sin.IVA

Feature Result
Variable type numeric
Number of missing obs. 213986 (13.11 %)
Number of unique values 439581
Median 326408.65
1st and 3rd quartiles 36100; 2236405.88
Min. and max. -443849906571097; 443849906571097

  • Note that the following possible outlier values were detected: "-443849906571097", "-329279023694080", "-17446804689232.4", "-11555412161049.8", "-8681541547378.88", …, "17447386501483.4", "22792811637665", "258896134619401", "329279115762827", "443849906571097" (50932 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:32:38

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Devoluciones

The dataset examined has the following dimensions:

Feature Result
Number of observations 1008
Number of variables 13

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 56 0.00 % ×
SkIdTercero integer 107 0.00 %
SkIdInsumo integer 569 0.00 % ×
SkIdFecha integer 377 0.00 %
SkIdEstado integer 4 0.00 % ×
SkIdBodega integer 54 0.00 % ×
Devolucion.Numero integer 549 0.00 % ×
Remision character 331 0.00 % ×
Total numeric 847 0.00 % ×
Devolucion.Factura character 175 0.00 % ×
Cantidad.Devuelta integer 181 0.00 % ×
Compra.No integer 415 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 56
Median 100210
1st and 3rd quartiles 100167; 100226
Min. and max. 1005; 100277

  • Note that the following possible outlier values were detected: "1005", "1009", "10011", "10012", "10013", …, "100264", "100268", "100269", "100275", "100277" (14 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 107
Median 5163
1st and 3rd quartiles 2095; 8603
Min. and max. 1919; 12678


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 569
Median 1005183.5
1st and 3rd quartiles 1002094.75; 10010618
Min. and max. 100114; 10018213

  • Note that the following possible outlier values were detected: "100114", "100115", "100140", "100141", "100143", …, "100901", "100904", "100906", "100918", "100961" (49 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 377
Median 20220616
1st and 3rd quartiles 20191128; 20240327
Min. and max. 20130812; 20251029


SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “10053”
Reference category 10050

  • Note that the following levels have at most five observations: "10051".

SkIdBodega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 54
Median 1000204
1st and 3rd quartiles 1000163; 1000225
Min. and max. 100; 1000277

  • Note that the following possible outlier values were detected: "100", "10005", "10009", "100011", "100012", …, "1000264", "1000268", "1000269", "1000275", "1000277" (13 values omitted).

Devolucion.Numero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 549
Median 20400012.5
1st and 3rd quartiles 1840007.75; 22500001
Min. and max. 50014; 27700001

  • Note that the following possible outlier values were detected: "24000001", "24000002", "24000003", "24000004", "24000005", …, "27500001", "27500002", "27500003", "27500004", "27700001" (25 values omitted).

Remision

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 331
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "-00439194", "00000000000965", "000288406", "001", "001-2", …, "RM13015118", "RM13015812", "rm48-694", "Vales: 178, 187, 188, 177 - Arena y grava para cic", "xxxx" (312 values omitted).


Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 847
Median 277830
1st and 3rd quartiles 50654.38; 1586548.91
Min. and max. 0; 111091518.6

  • Note that the following possible outlier values were detected: "22450857.97", "22694014", "24836112.31", "26798436", "28010291.4", …, "31939598.7", "32609968.65", "33304403.46", "37292701.95", "111091518.6" (3 values omitted).

Devolucion.Factura

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 175
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "000232", "001", "001-00000047880", "0036", "02", …, "NCSK990010781", "NCV23", "NCV24", "SALIDA POR AJUSTE", "x" (157 values omitted).


Cantidad.Devuelta

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 181
Median 10
1st and 3rd quartiles 2; 55
Min. and max. 0; 20000

  • Note that the following possible outlier values were detected: "908", "920", "950", "994", "1009", …, "9407", "10000", "11208", "14322", "20000" (37 values omitted).

Compra.No

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 415
Median 20400226
1st and 3rd quartiles 15700390.75; 22500006
Min. and max. 51474; 27700013

  • Note that the following possible outlier values were detected: "23700022", "23700035", "23900060", "23900067", "23900089", …, "26900191", "27500061", "27500062", "27500116", "27700013" (23 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:32:42

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.EjecucionCliente

The dataset examined has the following dimensions:

Feature Result
Number of observations 905
Number of variables 12

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 4 0.00 % ×
SkIdFecha integer 19 0.00 % ×
SkIdItems integer 238 0.00 % ×
SkIdCapitulo integer 13 0.00 % ×
SkIdEstado integer 1 0.00 % ×
SkIdEspecificacionEjecucionCliente numeric 28 0.00 % ×
Valor.Garantia numeric 17 0.00 %
Valor.Amortizacion numeric 8 0.00 %
Cantidad.Ejecucion.Cliente numeric 592 0.00 % ×
Valor.Unitario numeric 185 0.00 % ×
Valor.Total numeric 737 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4
Mode “100188”
Reference category 1006

  • Note that the following levels have at most five observations: "100129", "1006".

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 19
Median 20200519
1st and 3rd quartiles 20191105; 20201013
Min. and max. 20120216; 20250113

  • Note that the following possible outlier values were detected: "20120216", "20160224", "20241211", "20250113".

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 238
Median 10061380
1st and 3rd quartiles 10061316; 10064802
Min. and max. 1006736; 100113091

  • Note that the following possible outlier values were detected: "1006736", "1007113", "10031744", "100112911", "100112912", …, "100112923", "100112931", "100113085", "100113086", "100113091" (6 values omitted).

SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 13
Median 1001882511
1st and 3rd quartiles 1001882510; 1001882512
Min. and max. 1006157; 1002753954

  • Note that the following possible outlier values were detected: "1006157", "1001291385", "1001882521", "1001882524", "1001882525", "1001882527", "1002753939", "1002753940", "1002753954".

SkIdEstado

  • The variable only takes one (non-missing) value: "10080". The variable contains 0 % missing observations.

SkIdEspecificacionEjecucionCliente

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 28
Median 10018813
1st and 3rd quartiles 1001887; 10018818
Min. and max. 10061; 10027527500003

  • Note that the following possible outlier values were detected: "10027527500001", "10027527500002", "10027527500003".

Valor.Garantia

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 17
Median 51796257.1
1st and 3rd quartiles 0; 158295413.8
Min. and max. 0; 197374360.4


Valor.Amortizacion

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 8
Median 0
1st and 3rd quartiles 0; 76392772
Min. and max. 0; 354584821


Cantidad.Ejecucion.Cliente

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 592
Median 45.73
1st and 3rd quartiles 2; 383.68
Min. and max. -7001.4; 141305.29

  • Note that the following possible outlier values were detected: "-7001.4", "-5054.4", "-3229.29", "-1084.58", "-980.31", …, "70652.64", "73332.01", "81372.5", "88884.16", "141305.29" (72 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 185
Median 75900
1st and 3rd quartiles 8100; 1041389
Min. and max. 0; 1.4e+09

  • Note that the following possible outlier values were detected: "27865000", "3e+07", "34272000", "4.5e+07", "50674048", …, "81866586", "124564154", "183851314", "193499918", "1.4e+09" (2 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 737
Median 5862311.13
1st and 3rd quartiles 1512001.89; 24857044.48
Min. and max. -188593765.29; 1.4e+09

  • Note that the following possible outlier values were detected: "-188593765.29", "-47078364.06", "-33628554.24", "-24151954", "-21571313.4", …, "589950631.53", "644410139.7", "1024463338", "1231250573.15", "1.4e+09" (26 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:32:47

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.EjecucionEstandar

The dataset examined has the following dimensions:

Feature Result
Number of observations 2712
Number of variables 11

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 10 0.00 %
SkIdItems integer 1235 0.00 % ×
SkIdCapitulo integer 94 0.00 %
SkIdFecha integer 51 0.00 % ×
SkIdEstado integer 1 0.00 % ×
Numero.Ejecucion integer 48 0.00 % ×
Cantidad.Ejecucion numeric 1317 0.00 % ×
Valor.Unitario numeric 965 0.00 % ×
Valor.Unitario.Presupuesto numeric 900 0.00 % ×
Valor.Total numeric 1943 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 10
Median 100188
1st and 3rd quartiles 10028; 100188
Min. and max. 1003; 100275


SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1235
Median 10061361
1st and 3rd quartiles 10015697.75; 10062381
Min. and max. 1002462; 100124924

  • Note that the following possible outlier values were detected: "1002462", "1002463", "1002499", "1002503", "1002504", …, "100124920", "100124921", "100124922", "100124923", "100124924" (553 values omitted).

SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 94
Median 1001882510
1st and 3rd quartiles 10028579; 1001882521
Min. and max. 100346; 1002753954


SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 51
Median 20191126
1st and 3rd quartiles 20131203; 20200908
Min. and max. 20111021; 20250128

  • Note that the following possible outlier values were detected: "20220106", "20240830", "20240930", "20241127", "20241129", "20250113", "20250128".

SkIdEstado

  • The variable only takes one (non-missing) value: "10090". The variable contains 0 % missing observations.

Numero.Ejecucion

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 48
Median 14
1st and 3rd quartiles 3; 31
Min. and max. 1; 27500007

  • Note that the following possible outlier values were detected: "21100001", "21100002", "27500001", "27500002", "27500003", "27500004", "27500005", "27500006", "27500007".

Cantidad.Ejecucion

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1317
Median 8
1st and 3rd quartiles 1; 157.32
Min. and max. -32013; 272810.2

  • Note that the following possible outlier values were detected: "-32013", "-7001.4", "-5054.4", "-3229.29", "-1530", …, "88884.16", "116906", "134391.73", "141305.29", "272810.2" (178 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 965
Median 159399.95
1st and 3rd quartiles 26855.12; 1394000
Min. and max. 0; 1.4e+09

  • Note that the following possible outlier values were detected: "27865000", "29901856.51", "3e+07", "30000023.25", "3.1e+07", …, "2e+08", "210765125", "254375820", "1.03e+09", "1.4e+09" (36 values omitted).

Valor.Unitario.Presupuesto

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 900
Median 174239.44
1st and 3rd quartiles 27999.99; 1461403
Min. and max. 0; 1317647059

  • Note that the following possible outlier values were detected: "3e+07", "30000023.26", "3.1e+07", "33500000", "34150427.96", …, "193499918", "2e+08", "254375820", "1.03e+09", "1317647059" (28 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1943
Median 4110720
1st and 3rd quartiles 685120; 14916988
Min. and max. -188593765.29; 1070372500000

  • Note that the following possible outlier values were detected: "-188593765.29", "-98637946.59", "-73574640", "-47078364.06", "-39820789.5", …, "30856474267.74", "51610166991.88", "226878600000", "344872700000", "1070372500000" (125 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:32:53

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.EntradasAlmacen

The dataset examined has the following dimensions:

Feature Result
Number of observations 120665
Number of variables 16

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 68 0.00 %
SkIdFechaCompra integer 3483 0.00 %
SkIdFechaEntrada integer 3895 0.00 %
SkIdTercero integer 750 0.00 % ×
SkIdInsumo integer 6858 0.00 % ×
SkIdBodega integer 68 0.00 %
SkIdEstadoPorDocumento integer 5 0.00 % ×
SkIdEspecificacionEntradasAlmacen numeric 57476 0.00 %
Total.Entrada numeric 58481 0.00 % ×
Compra.Numero integer 23185 0.00 % ×
Compra.Total.Pagar numeric 19682 0.00 % ×
Entrada.Valor.Iva numeric 54069 0.00 % ×
Entrada.Valor.Sin.Iva numeric 53998 0.00 % ×
Entrada.Cantidad numeric 12348 0.00 % ×
Entrada.Valor.Amortizado numeric 8057 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 68
Median 100157
1st and 3rd quartiles 10035; 100217
Min. and max. 1003; 100294


SkIdFechaCompra

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3483
Median 20190517
1st and 3rd quartiles 20150119; 20230629
Min. and max. 20110616; 20251030


SkIdFechaEntrada

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3895
Median 20190628
1st and 3rd quartiles 20150217; 20230817
Min. and max. 20110630; 20251031


SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 750
Median 5485
1st and 3rd quartiles 4553; 8489
Min. and max. 71; 12696

  • Note that the following possible outlier values were detected: "71", "85", "98", "175", "188", …, "2534", "2536", "2648", "2666", "2696" (120 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 6858
Median 1003692
1st and 3rd quartiles 1001775; 1008226
Min. and max. 100101; 10019275

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10019209", "10019210", "10019218", "10019220", "10019275" (3403 values omitted).

SkIdBodega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 68
Median 1000157
1st and 3rd quartiles 100035; 1000217
Min. and max. 10003; 1000294


SkIdEstadoPorDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 5
Mode “100134”
Reference category 100130

  • Note that the following levels have at most five observations: "100131".

SkIdEspecificacionEntradasAlmacen

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 57476
Median 1001571570761
1st and 3rd quartiles 10035350079; 10021721700238
Min. and max. 100330001; 10029429400005


Total.Entrada

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 58481
Median 588874
1st and 3rd quartiles 79642; 3036581.67
Min. and max. 0; 3057433903.72

  • Note that the following possible outlier values were detected: "38850000", "38853024", "38881919.36", "38887474.5", "38898034.56", …, "285715830.1", "415068501.99", "909115277.42", "1294105349.53", "3057433903.72" (1086 values omitted).

Compra.Numero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 23185
Median 15700060
1st and 3rd quartiles 350599; 21700041
Min. and max. 30001; 29400001

  • Note that the following possible outlier values were detected: "27800001", "27800003", "27800004", "27800005", "27800006", …, "28300001", "28300002", "28300004", "28300005", "29400001" (11 values omitted).

Compra.Total.Pagar

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 19682
Median 9737910
1st and 3rd quartiles 1276000; 89250000
Min. and max. -20230; 8922395811.7

  • Note that the following possible outlier values were detected: "1766205396.18", "1812373808", "1951600000", "2021780415.95", "2809854998.73", "3057433903.72", "3909008207.71", "3947827719.8", "3996857167.4", "8922395811.7".

Entrada.Valor.Iva

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 54069
Median 60000
1st and 3rd quartiles 6650; 424870.4
Min. and max. 0; 488161715.72

  • Note that the following possible outlier values were detected: "6881830.4", "6883548", "6891548.75", "6891875.8", "6904854.41", …, "45618493.88", "57250827.86", "145152859.42", "206621862.53", "488161715.72" (851 values omitted).

Entrada.Valor.Sin.Iva

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 53998
Median 504400
1st and 3rd quartiles 68100; 2589202
Min. and max. 0; 2569272188

  • Note that the following possible outlier values were detected: "33148890", "33156250", "33160050", "33189625", "33213188.75", …, "266874835", "357817674.13", "763962418", "1087483487", "2569272188" (1069 values omitted).

Entrada.Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 12348
Median 12
1st and 3rd quartiles 4; 100
Min. and max. 0; 95943

  • Note that the following possible outlier values were detected: "2079.76", "2080", "2081", "2081.2", "2082.4", …, "72400", "78400", "80024", "82000", "95943" (4157 values omitted).

Entrada.Valor.Amortizado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 8057
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 826031795

  • Note that the following possible outlier values were detected: "0", "0.06", "0.08", "0.66", "203", …, "236391056", "267004992", "289014484", "619484697", "826031795" (8046 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:02

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.InventarioResumido

The dataset examined has the following dimensions:

Feature Result
Number of observations 335598
Number of variables 13

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
Empresa character 1 0.00 % ×
SkIdFecha integer 4203 0.00 %
SkIdProyecto integer 71 0.00 %
SkIdInsumo integer 7382 0.00 % ×
Tipo character 9 0.00 %
Documento integer 69753 0.00 %
Bodega integer 1 0.00 % ×
Cantidad numeric 30147 0.00 % ×
Unitario.Neto numeric 53758 0.00 % ×
Valor.Iva numeric 58609 0.00 % ×
Unitario numeric 53758 0.00 % ×
Total numeric 155049 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 4203
Median 20190226
1st and 3rd quartiles 20150417; 20230529
Min. and max. 20110630; 20251031


SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 71
Median 100167
1st and 3rd quartiles 10035; 100226
Min. and max. 1003; 100294


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7382
Median 1003966
1st and 3rd quartiles 1001614; 1008512
Min. and max. 100101; 10019275

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10019209", "10019210", "10019218", "10019220", "10019275" (3805 values omitted).

Tipo

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 9
Mode “SA”


Documento

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 69753
Median 1880865
1st and 3rd quartiles 310563.25; 21000541
Min. and max. -10; 174000235


Bodega

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 30147
Median -1
1st and 3rd quartiles -10; 7
Min. and max. -743487.2; 272825

  • Note that the following possible outlier values were detected: "-743487.2", "-422650", "-394073.2", "-390811.69", "-344627", …, "95943", "97806", "136661.9", "175019", "272825" (25290 values omitted).

Unitario.Neto

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 53758
Median 10115
1st and 3rd quartiles 2813.61; 48000
Min. and max. -1160; 1087483487

  • Note that the following possible outlier values were detected: "-1160", "700791", "700910", "700969.5", "701564.5", …, "688343461.35", "688343461.35", "688343461.35", "763962418", "1087483487" (3235 values omitted).

Valor.Iva

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 58609
Median 0
1st and 3rd quartiles 0; 17920
Min. and max. 0; 488161715.72

  • Note that the following possible outlier values were detected: "557878.8", "557961.6", "557973", "558077.5", "558083.3", …, "57250827.86", "134083239.21", "145152859.42", "206621862.53", "488161715.72" (17093 values omitted).

Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 53758
Median 10115
1st and 3rd quartiles 2813.61; 48000
Min. and max. -1160; 1087483487

  • Note that the following possible outlier values were detected: "-1160", "700791", "700910", "700969.5", "701564.5", …, "688343461.35", "688343461.35", "688343461.35", "763962418", "1087483487" (3235 values omitted).

Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 155049
Median -7638.18
1st and 3rd quartiles -170836.4; 220000
Min. and max. -3627960661.17; 3057433903.72

  • Note that the following possible outlier values were detected: "-3627960661.17", "-1446488216", "-1317428982.35", "-1286132238.23", "-1278192899.21", …, "845139203.01", "909115277.42", "1294105349.53", "1331291738.96", "3057433903.72" (73184 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:17

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.NotasEnValor

The dataset examined has the following dimensions:

Feature Result
Number of observations 220
Number of variables 9

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdTercero integer 33 0.00 % ×
SkIdProyecto integer 36 0.00 % ×
SkIdFecha integer 110 0.00 % ×
SkIdInsumo integer 75 0.00 % ×
SkIdEstado integer 3 0.00 % ×
Nota.Numero integer 143 0.00 %
Total.devolucion numeric 179 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 33
Median 4814
1st and 3rd quartiles 4814; 8489
Min. and max. 1920; 12296

  • Note that the following possible outlier values were detected: "1920", "2034", "2306", "2666", "3842", "4463", "4605".

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 36
Median 100135
1st and 3rd quartiles 100118; 100228
Min. and max. 1003; 100275

  • Note that the following possible outlier values were detected: "1003", "1009", "10011", "10012", "10013", "10028", "10031", "10035".

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 110
Median 20180757
1st and 3rd quartiles 20171019; 20200109
Min. and max. 20130401; 20251009

  • Note that the following possible outlier values were detected: "20130401", "20130604", "20131025", "20140310", "20140327", …, "20160727", "20160802", "20160808", "20161104", "20161214" (32 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 75
Median 1003477
1st and 3rd quartiles 1001983; 10012089.5
Min. and max. 100143; 10018010

  • Note that the following possible outlier values were detected: "100143", "100146", "100147", "100149", "100151", "100297", "100805", "100884".

SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “10043”
Reference category 10040

  • Note that the following levels have at most five observations: "10040", "10042".

Nota.Numero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 143
Median 120.5
1st and 3rd quartiles 89.75; 16700003.25
Min. and max. 3; 27500002


Total.devolucion

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 179
Median -922822.54
1st and 3rd quartiles -2402847; -205490.75
Min. and max. -36534714; 1118525

  • Note that the following possible outlier values were detected: "-36534714", "-23240432", "-18344203", "-16652565", "950171", "1118525".

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:19

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Pedidos

The dataset examined has the following dimensions:

Feature Result
Number of observations 100732
Number of variables 10

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 59 0.00 % ×
SkIdCapitulo integer 597 0.00 % ×
SkIdPedido integer 55504 0.00 % ×
SkIdItems integer 7313 0.00 % ×
SkIdInsumo integer 5581 0.00 % ×
SkIdFechaPedido integer 1866 0.00 % ×
SkIdFechaRequerido integer 2043 0.00 % ×
SkIdEstado integer 7 0.00 % ×
Cantidad numeric 18224 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 59
Median 100217
1st and 3rd quartiles 100211; 100239
Min. and max. 1006; 100295

  • Note that the following possible outlier values were detected: "1006", "1009", "10021", "10029", "10030", …, "100171", "100173", "100174", "100184", "100188" (16 values omitted).

SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 597
Median 1002173804
1st and 3rd quartiles 1002112832; 1002393392
Min. and max. 1006166; 1002954538

  • Note that the following possible outlier values were detected: "1006166", "1009342", "10021454", "10030986", "100291591", …, "1002042739", "1002042740", "1002042741", "1002042902", "1002042937" (229 values omitted).

SkIdPedido

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 55504
Median 100118050
1st and 3rd quartiles 100103631.75; 100132065
Min. and max. 10019331; 100141328

  • Note that the following possible outlier values were detected: "10019331", "10026009", "10027787", "10027788", "10027789", …, "10099995", "10099996", "10099997", "10099998", "10099999" (16729 values omitted).

SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7313
Median 10097995
1st and 3rd quartiles 10079534; 100108006
Min. and max. 1006814; 100144963

  • Note that the following possible outlier values were detected: "1006814", "1009753".

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 5581
Median 1003189
1st and 3rd quartiles 1001985; 10010498
Min. and max. 100101; 10019323

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "100992", "100994", "100997", "100998", "100999" (363 values omitted).

SkIdFechaPedido

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1866
Median 20240220
1st and 3rd quartiles 20230214; 20240511
Min. and max. 20131024; 20251031

  • Note that the following possible outlier values were detected: "20240902", "20240903", "20240904", "20240905", "20240906", …, "20251027", "20251028", "20251029", "20251030", "20251031" (320 values omitted).

SkIdFechaRequerido

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2043
Median 20240221
1st and 3rd quartiles 20230214; 20240514
Min. and max. 19000101; 20430621

  • Note that the following possible outlier values were detected: "19000101", "20240905", "20240906", "20240907", "20240909", …, "20321117", "20360615", "20360709", "20400316", "20430621" (362 values omitted).

SkIdEstado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 7
Median 10073
1st and 3rd quartiles 10073; 10073
Min. and max. -10075; 10073

  • Note that the following possible outlier values were detected: "-10075", "-10072", "-10071", "10070", "10071", "10072".

Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 18224
Median 6
1st and 3rd quartiles 1; 50
Min. and max. 0; 1821837.73

  • Note that the following possible outlier values were detected: "954.34", "955", "956.69", "956.84", "956.86", …, "819643.14", "860521", "1735293.85", "1768184.61", "1821837.73" (3691 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:25

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.ProductosProveedorZona

The dataset examined has the following dimensions:

Feature Result
Number of observations 0
Number of variables 12

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa logical 0 NaN % ×
SkIdZona logical 0 NaN % ×
SkIdInsumo logical 0 NaN % ×
SkIdTercero logical 0 NaN % ×
SkIdFechaCotizacion logical 0 NaN % ×
SkIdFechaVigencia logical 0 NaN % ×
ValorSinIVA logical 0 NaN % ×
PorcentajeDescuento logical 0 NaN % ×
IVA logical 0 NaN % ×
CantidadMinima logical 0 NaN % ×
DiasMaxmoParaEntrega logical 0 NaN % ×
ProveedorPrincipal logical 0 NaN % ×

Variable list

SkIdEmpresa

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdZona

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdInsumo

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdTercero

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdFechaCotizacion

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


SkIdFechaVigencia

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


ValorSinIVA

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


PorcentajeDescuento

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


IVA

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


CantidadMinima

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


DiasMaxmoParaEntrega

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


ProveedorPrincipal

  • The variable is a key (distinct values for each observation).

  • The variable only takes one value: "NA".


Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:30

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Programacion

The dataset examined has the following dimensions:

Feature Result
Number of observations 421
Number of variables 7

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 1 0.00 % ×
SkIdActividad numeric 419 0.00 % ×
SkIdFechaInicial integer 132 0.00 %
SkIdFechaFinal integer 146 0.00 %
Duracion integer 66 0.00 % ×
PorcentajeAsignado numeric 13 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

  • The variable only takes one (non-missing) value: "100269". The variable contains 0 % missing observations.

SkIdActividad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 419
Median 1004355731714487
1st and 3rd quartiles 1002050842307332; 1006724863852852
Min. and max. 1004758460013; 1008861420467332

  • Note that the following possible outlier values were detected: "1004758460013", "10038346358630", "10038443305114", "10044414783636", "10050262658754", …, "100830440010135", "100855018116287", "100863072203575", "100864418640804", "100867322653500" (47 values omitted).

SkIdFechaInicial

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 132
Median 20260530
1st and 3rd quartiles 20250617; 20270104
Min. and max. 20241202; 20270626


SkIdFechaFinal

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 146
Median 20260815
1st and 3rd quartiles 20250830; 20270213
Min. and max. 20241202; 20270803


Duracion

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 66
Median 10
1st and 3rd quartiles 4; 53
Min. and max. 0; 766

  • Note that the following possible outlier values were detected: "0".

PorcentajeAsignado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 13
Median 0
1st and 3rd quartiles 0; 0
Min. and max. 0; 1

  • Note that the following possible outlier values were detected: "0.05", "0.12", "0.14", "0.14", "0.2", …, "0.42", "0.58", "0.61", "0.65", "1" (2 values omitted).

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:31

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Proyeccion

The dataset examined has the following dimensions:

Feature Result
Number of observations 275143
Number of variables 19

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 77 0.00 %
SkIdCapitulo integer 1389 0.00 %
SkIdItems integer 34007 0.00 % ×
SkIdInsumo integer 11238 0.00 % ×
SkIdReforma logical 1 100.00 % ×
SkIdUsuario integer 27 0.00 % ×
SkIdFecha integer 2785 0.00 % ×
SkIdFecha.Real integer 2803 0.00 %
SkIdEstado integer 3 0.00 %
Cantidad numeric 84100 0.00 % ×
Valor.Unitario numeric 106411 0.00 % ×
Valor.Total numeric 205239 0.00 % ×
Origen character 12 0.00 % ×
Causa integer 16 0.00 %
Cantidad.Item numeric 14599 72.40 % ×
Descripcion.Causa character 16 0.00 % ×
Ajuste.Global integer 1 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 77
Median 100157
1st and 3rd quartiles 10035; 100225
Min. and max. 1003; 100295


SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1389
Median 1001572385
1st and 3rd quartiles 100291683; 1002253214
Min. and max. 100346; 1002954534


SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 34007
Median 10057210
1st and 3rd quartiles 10026866; 10084837
Min. and max. 1002462; 100145631

  • Note that the following possible outlier values were detected: "1002462", "1002463", "1002464", "1002499", "1002503", …, "100145626", "100145628", "100145629", "100145630", "100145631" (9125 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11238
Median 1005999
1st and 3rd quartiles 1002291; 1008656.5
Min. and max. 100101; 10019349

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10019345", "10019346", "10019347", "10019348", "10019349" (6028 values omitted).

SkIdReforma

  • The variable only takes one value: "NA".

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 27
Median 100140
1st and 3rd quartiles 100140; 100370
Min. and max. 100; 100513

  • Note that the following possible outlier values were detected: "100", "10068", "10069", "10086", "100103", "100128".

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2785
Median 20190521
1st and 3rd quartiles 20151020; 20231011
Min. and max. 19000101; 20251031

  • Note that the following possible outlier values were detected: "19000101".

SkIdFecha.Real

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2803
Median 20190524
1st and 3rd quartiles 20151025; 20231011
Min. and max. 20110929; 20251031


SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “100101”
Reference category 100100


Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 84100
Median 0
1st and 3rd quartiles -2; 11.58
Min. and max. -8.145313e+13; 8.145324e+13

  • Note that the following possible outlier values were detected: "-8.145313e+13", "-173596289527.98", "-129549469797", "-108497681110.76", "-107957891652.5", …, "107957892710", "108497682173.55", "129549471252", "173596291477.68", "8.145324e+13" (53143 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 106411
Median 11781
1st and 3rd quartiles 968.96; 77350
Min. and max. -159269017478500; 170210911826087

  • Note that the following possible outlier values were detected: "-159269017478500", "-52014597878772.4", "-43068230284084", "-3.6e+13", "-29122346411764.7", …, "2751068759912.96", "3386882117000", "3868403136000", "4786404800000", "170210911826087" (26835 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 205239
Median 10090.27
1st and 3rd quartiles -308037.4; 805160.85
Min. and max. -521909894612433; 521909894612433

  • Note that the following possible outlier values were detected: "-521909894612433", "-516969750214325", "-493906259178344", "-474733570021391", "-443849906571097", …, "443849906571097", "474733570495248", "493906259178344", "516969755846935", "521909894612433" (72257 values omitted).

Origen

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 12
Mode “”

  • The following suspected missing value codes enter as regular values: "".

  • Note that the following levels have at most five observations: "AMP".


Causa

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 16
Median 18
1st and 3rd quartiles 2; 40
Min. and max. 1; 63


Cantidad.Item

Feature Result
Variable type numeric
Number of missing obs. 199195 (72.4 %)
Number of unique values 14598
Median 1
1st and 3rd quartiles -37.61; 49.2
Min. and max. -399981840.74; 399981840.74

  • Note that the following possible outlier values were detected: "-399981840.74", "-29985524.77", "-6999730", "-4997835.49", "-2000087", …, "2000087", "4997817.17", "6970704.45", "29985524.77", "399981840.74" (6790 values omitted).

Descripcion.Causa

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 16
Mode “Actividad Terminada *”

  • The following values appear with prefixed or suffixed white space: "Actualizacion de precios ", "C. Especificaciones ".

  • Note that the following levels have at most five observations: "C. C. Mano de Obra".


Ajuste.Global

  • The variable only takes one (non-missing) value: "0". The variable contains 0 % missing observations.

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:44

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Reintegro

The dataset examined has the following dimensions:

Feature Result
Number of observations 2872
Number of variables 12

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 35 0.00 % ×
SkIdTercero integer 70 0.00 % ×
SkIdFecha integer 385 0.00 %
SkIdInsumo integer 840 0.00 % ×
SkIdBodega integer 33 0.00 % ×
Numero.Reintegro integer 1147 0.00 % ×
Remision character 115 0.00 % ×
Cantidad numeric 758 0.00 % ×
Valor.Unitario numeric 1381 0.00 % ×
Valor.Total numeric 2297 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 35
Median 100217
1st and 3rd quartiles 100211; 100226
Min. and max. 10031; 100275

  • Note that the following possible outlier values were detected: "10031", "100135", "100157", "100167", "100170", …, "100188", "100262", "100268", "100269", "100275" (4 values omitted).

SkIdTercero

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 70
Median 11778
1st and 3rd quartiles 11778; 11778
Min. and max. 1377; 12701

  • Note that the following possible outlier values were detected: "1377", "1919", "1920", "2034", "2313", …, "12512", "12593", "12627", "12628", "12701" (59 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 385
Median 20230927
1st and 3rd quartiles 20220824; 20241203.25
Min. and max. 20190416; 20251031


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 840
Median 1005212
1st and 3rd quartiles 1001985; 10010498
Min. and max. 100101; 10018213

  • Note that the following possible outlier values were detected: "100101", "100114", "100115", "100116", "100140", …, "100985", "100986", "100987", "100997", "100998" (85 values omitted).

SkIdBodega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 33
Median 1000217
1st and 3rd quartiles 1000204; 1000226
Min. and max. 100; 1000275

  • Note that the following possible outlier values were detected: "100", "100031", "1000135", "1000253", "1000255", "1000256", "1000262", "1000268", "1000269", "1000275".

Numero.Reintegro

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1147
Median 2170099
1st and 3rd quartiles 2110002; 2260085
Min. and max. 310001; 2750002

  • Note that the following possible outlier values were detected: "310001", "310002", "310003", "310004", "310005", …, "2680007", "2690001", "2690002", "2750001", "2750002" (252 values omitted).

Remision

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 115
Mode “”

  • The following suspected missing value codes enter as regular values: "", "8", "9".

  • The following values appear with prefixed or suffixed white space: " 2620009".

  • Note that the following levels have at most five observations: " 2620009", "0109", "0192", "04", "05", …, "ajuste", "NC -099", "R2930", "R3817", "salida 310" (90 values omitted).


Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 758
Median 10
1st and 3rd quartiles 2; 50.9
Min. and max. 0; 272825

  • Note that the following possible outlier values were detected: "751.3", "752", "753.87", "758", "773.68", …, "85755.8", "94000", "97806", "175019", "272825" (157 values omitted).

Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 1381
Median 14109.6
1st and 3rd quartiles 4636.24; 55181.49
Min. and max. 0; 688343461.35

  • Note that the following possible outlier values were detected: "0", "21.48", "152.32", "193.97", "198.73", …, "18367650", "31836070", "89772333.99", "688343461.35", "688343461.35" (58 values omitted).

Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 2297
Median 267571.5
1st and 3rd quartiles 53550; 1171117.42
Min. and max. 0; 1331291738.96

  • Note that the following possible outlier values were detected: "15253977.54", "15465435.16", "15475672.53", "15670017.29", "15952169.08", …, "605811080.33", "647180522.36", "688343461.35", "845139203.01", "1331291738.96" (90 values omitted).

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:47

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.SalidasAlmacen

The dataset examined has the following dimensions:

Feature Result
Number of observations 175811
Number of variables 18

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 67 0.00 %
SkIdFechaSalida integer 3801 0.00 %
SkIdInsumo integer 6667 0.00 % ×
SkIdTercero numeric 524 3.12 % ×
SkIdOrigenDelDocumento integer 1 0.00 % ×
SkIdEstadoPorDocumento integer 2 0.00 %
SkIdItems integer 12573 0.00 % ×
SkIdBodega integer 67 0.00 %
Salida.Numero numeric 56568 0.00 %
Salida.Remision character 20174 0.00 % ×
Salida.Usuario character 65 0.00 % ×
Salida.Descuento character 2 0.00 %
Salida.Cantidad numeric 14862 0.00 % ×
Salida.Valor.Unitario numeric 29829 0.00 % ×
Salida.Valor.Total numeric 79492 0.00 % ×
Descuentos.Cantidad numeric 14870 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 67
Median 100132
1st and 3rd quartiles 10029; 100210
Min. and max. 1003; 100294


SkIdFechaSalida

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3801
Median 20181228
1st and 3rd quartiles 20140826; 20230512
Min. and max. 20110819; 20251031


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 6667
Median 1003993
1st and 3rd quartiles 1001404; 1008154
Min. and max. 100101; 10019220

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10018959", "10018971", "10019101", "10019208", "10019220" (3266 values omitted).

SkIdTercero

Feature Result
Variable type numeric
Number of missing obs. 5486 (3.12 %)
Number of unique values 523
Median 11778
1st and 3rd quartiles 7292; 11778
Min. and max. 22; 12701

  • Note that the following possible outlier values were detected: "11938", "11944", "11951", "11959", "11966", …, "12600", "12620", "12627", "12628", "12701" (47 values omitted).

SkIdOrigenDelDocumento

  • The variable only takes one (non-missing) value: "1". The variable contains 0 % missing observations.

SkIdEstadoPorDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “100120”
Reference category 100120


SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 12573
Median 10054420
1st and 3rd quartiles 10020359; 10076864
Min. and max. 100; 100144957

  • Note that the following possible outlier values were detected: "100", "1002499", "1002503", "1002504", "1002506", …, "100144594", "100144596", "100144774", "100144874", "100144957" (2931 values omitted).

SkIdBodega

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 67
Median 1000132
1st and 3rd quartiles 100029; 1000210
Min. and max. 10003; 1000294


Salida.Numero

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 56568
Median 1320073
1st and 3rd quartiles 290527.5; 21100605
Min. and max. 0; 174000235


Salida.Remision

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 20174
Mode “”

  • The following suspected missing value codes enter as regular values: "", " ", " ", ". 1553", ".1766", …, "888", "9", "99", "999", "9999" (4 values omitted).

  • The following values appear with prefixed or suffixed white space: " ", " ", " 2173", " 1256", " 0346", …, "CONTROL F ", "desc. ", "DEV ACERO ", "MAYO ", "mayo 31 " (60 values omitted).

  • Note that the following levels have at most five observations: " 1256", " 0346", " 957", " 0098", " 0318", …, "sc0465", "v", "vario", "Ver resume", "VS14061" (12851 values omitted).

  • Note that there might be case problems with the following levels: "acero", "Acero", "ACERO", "acta 4", "ACTA 4", …, "XX", "xxx", "XXX", "xxxx", "XXXX" (57 values omitted).


Salida.Usuario

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 65
Mode “Julio Cesar Gomez Sanchez”

  • The following values appear with prefixed or suffixed white space: "Maria Mercedes Arias ".

  • Note that the following levels have at most five observations: "Carlos Alfonso Maury Maury", "Diego Urrego Perez", "Erick Jose Ocon Gomez", "Nubia Andrea Lara Palma".


Salida.Descuento

Feature Result
Variable type character
Number of missing obs. 0 (0 %)
Number of unique values 2
Mode “NO”


Salida.Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 14862
Median 6
1st and 3rd quartiles 2; 40
Min. and max. 0; 2973948.8

  • Note that the following possible outlier values were detected: "677.8", "677.87", "678", "678.72", "679", …, "316878.9", "326823.44", "390811.69", "545650", "2973948.8" (6229 values omitted).

Salida.Valor.Unitario

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 29829
Median 10015.81
1st and 3rd quartiles 3195.8; 31571.27
Min. and max. -1160; 688343461.35

  • Note that the following possible outlier values were detected: "-1160", "309400", "309526.14", "309634.1", "309747.26", …, "413237307.12", "415068501.99", "688343461.35", "688343461.35", "688343461.35" (4030 values omitted).

Salida.Valor.Total

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 79492
Median 111860
1st and 3rd quartiles 19784.16; 661200
Min. and max. -638000; 14511842644.69

  • Note that the following possible outlier values were detected: "-638000", "10357454.88", "10358362.35", "10360043.52", "10362948.4", …, "1263989651.11", "1286132238.23", "1376686922.69", "2634857964.69", "14511842644.69" (6638 values omitted).

Descuentos.Cantidad

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 14870
Median 6
1st and 3rd quartiles 2; 40
Min. and max. -4; 2973948.8

  • Note that the following possible outlier values were detected: "-4", "668.64", "669", "669.15", "669.93", …, "316878.9", "326823.44", "390811.69", "545650", "2973948.8" (6259 values omitted).

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:33:56

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


ADP_DTM_FACT.Traslados

The dataset examined has the following dimensions:

Feature Result
Number of observations 608
Number of variables 15

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto.Traslado integer 24 0.00 %
SkIdProyecto.Entrada integer 8 0.00 %
SkIdInsumo integer 373 0.00 % ×
SkIdFecha integer 81 0.00 %
SkIdEstadoPorDocumento integer 3 0.00 %
Numero.Traslado integer 145 0.00 %
Cantidad.Traslado numeric 217 0.00 % ×
Valor.Unitario.Traslado numeric 482 0.00 % ×
Valor.Total.Traslado numeric 547 0.00 % ×
Numero.Entrada.Traslado numeric 23 72.37 % ×
Cantidad.Entrada.Traslado numeric 94 0.00 % ×
Unitario.Entrada.Traslado numeric 138 0.00 % ×
Total.Entrada.Traslado numeric 165 0.00 % ×
Empresa character 1 0.00 % ×

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto.Traslado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 24
Median 100184
1st and 3rd quartiles 10029; 100211
Min. and max. 1005; 100241


SkIdProyecto.Entrada

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 8
Median 100
1st and 3rd quartiles 100; 100217
Min. and max. 100; 100241


SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 373
Median 1005025
1st and 3rd quartiles 1002279.5; 1008405
Min. and max. 100143; 10017507

  • Note that the following possible outlier values were detected: "100143", "100147", "100148", "100149", "100151", …, "10016565", "10016567", "10016568", "10016975", "10017507" (126 values omitted).

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 81
Median 20201013
1st and 3rd quartiles 20150528; 20230713
Min. and max. 20121113; 20251029


SkIdEstadoPorDocumento

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.
Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 3
Mode “100141”
Reference category 100140


Numero.Traslado

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 145
Median 212
1st and 3rd quartiles 90.75; 237
Min. and max. 3; 250


Cantidad.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 217
Median 18
1st and 3rd quartiles 4; 148.5
Min. and max. 1; 32825

  • Note that the following possible outlier values were detected: "3000", "3003", "3115.12", "3164.17", "3196.7", …, "21590.23", "23007.43", "25526", "26312.71", "32825" (24 values omitted).

Valor.Unitario.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 482
Median 8731.95
1st and 3rd quartiles 2783; 29206.17
Min. and max. 20; 3153947.44

  • Note that the following possible outlier values were detected: "297540", "298922.05", "321300", "328797.36", "344288.38", …, "1195340.72", "1230460", "1895670", "2618000", "3153947.44" (22 values omitted).

Valor.Total.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 547
Median 233468.47
1st and 3rd quartiles 52092.28; 1329719.03
Min. and max. 1013.63; 118986074.62

  • Note that the following possible outlier values were detected: "19712081.35", "19777536", "20082021.12", "20238490.38", "20499263.68", …, "65700738.2", "70539791.71", "86413153.68", "97631020.06", "118986074.62" (11 values omitted).

Numero.Entrada.Traslado

Feature Result
Variable type numeric
Number of missing obs. 440 (72.37 %)
Number of unique values 22
Median 209
1st and 3rd quartiles 203; 209
Min. and max. 189; 212

  • Note that the following possible outlier values were detected: "210", "211", "212".

Cantidad.Entrada.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 94
Median 0
1st and 3rd quartiles 0; 1
Min. and max. 0; 26312.71

  • Note that the following possible outlier values were detected: "36", "40", "48", "50", "52", …, "6710.69", "14518.63", "15587.97", "21590.23", "26312.71" (62 values omitted).

Unitario.Entrada.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 138
Median 0
1st and 3rd quartiles 0; 3289.76
Min. and max. 0; 1195340.72

  • Note that the following possible outlier values were detected: "210011.2", "238000", "256399.78", "344288.38", "892356.6", "1195340.72".

Total.Entrada.Traslado

Feature Result
Variable type numeric
Number of missing obs. 0 (0 %)
Number of unique values 165
Median 0
1st and 3rd quartiles 0; 25971.75
Min. and max. 0; 118986074.62

  • Note that the following possible outlier values were detected: "879760.94", "949694.68", "1195340.72", "1244145", "1274368.91", …, "53541396.3", "65700738.2", "70539791.71", "97631020.06", "118986074.62" (55 values omitted).

Empresa

  • The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:34:34

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)


eda_Proyeccion

The dataset examined has the following dimensions:

Feature Result
Number of observations 275143
Number of variables 19

Checks performed

The following variable checks were performed, depending on the data type of each variable:

  character factor labelled haven labelled numeric integer logical Date
Identify miscoded missing values × × × × × × ×
Identify prefixed and suffixed whitespace × × × ×
Identify levels with < 6 obs. × × × ×
Identify case issues × × × ×
Identify misclassified numeric or integer variables × × × ×
Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table

  Variable class # unique values Missing observations Any problems?
SkIdEmpresa integer 1 0.00 % ×
SkIdProyecto integer 77 0.00 %
SkIdCapitulo integer 1389 0.00 %
SkIdItems integer 34007 0.00 % ×
SkIdInsumo integer 11238 0.00 % ×
SkIdReforma logical 1 100.00 % ×
SkIdUsuario integer 27 0.00 % ×
SkIdFecha integer 2785 0.00 % ×
SkIdFecha.Real integer 2803 0.00 %
SkIdEstado integer 3 0.00 %

Variable list

SkIdEmpresa

  • The variable only takes one (non-missing) value: "100". The variable contains 0 % missing observations.

SkIdProyecto

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 77
Median 100157
1st and 3rd quartiles 10035; 100225
Min. and max. 1003; 100295


SkIdCapitulo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 1389
Median 1001572385
1st and 3rd quartiles 100291683; 1002253214
Min. and max. 100346; 1002954534


SkIdItems

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 34007
Median 10057210
1st and 3rd quartiles 10026866; 10084837
Min. and max. 1002462; 100145631

  • Note that the following possible outlier values were detected: "1002462", "1002463", "1002464", "1002499", "1002503", …, "100145626", "100145628", "100145629", "100145630", "100145631" (9125 values omitted).

SkIdInsumo

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 11238
Median 1005999
1st and 3rd quartiles 1002291; 1008656.5
Min. and max. 100101; 10019349

  • Note that the following possible outlier values were detected: "100101", "100106", "100107", "100108", "100109", …, "10019345", "10019346", "10019347", "10019348", "10019349" (6028 values omitted).

SkIdReforma

  • The variable only takes one value: "NA".

SkIdUsuario

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 27
Median 100140
1st and 3rd quartiles 100140; 100370
Min. and max. 100; 100513

  • Note that the following possible outlier values were detected: "100", "10068", "10069", "10086", "100103", "100128".

SkIdFecha

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2785
Median 20190521
1st and 3rd quartiles 20151020; 20231011
Min. and max. 19000101; 20251031

  • Note that the following possible outlier values were detected: "19000101".

SkIdFecha.Real

Feature Result
Variable type integer
Number of missing obs. 0 (0 %)
Number of unique values 2803
Median 20190524
1st and 3rd quartiles 20151025; 20231011
Min. and max. 20110929; 20251031


SkIdEstado

  • Note that this variable is treated as a factor variable below, as it only takes a few unique values.

Report generation information:

  • Created by: RamiroSeb (username: SEBASTIAN).

  • Report creation time: dom nov. 02 2025 14:12:48

  • Report was run from directory: D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review

  • dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]

  • R version 4.5.0 (2025-04-11 ucrt).

  • Platform: x86_64-w64-mingw32/x64(America/Bogota).

  • Function call: makeDataReport(data = df, output = "html", file = "20251102/reporte_eda_Proyeccion", replace = TRUE, openResult = FALSE)